# --------------------------------------------------- # TERMINALFOUR INTERNAL INDEXER (CMS Site Search) # --------------------------------------------------- User-agent: terminalfour-nutch-spider Allow: / Allow: /site-search/ Disallow: /site-search/?* Disallow: /*?search=* Disallow: /*&search=* # --------------------------------------------------- # AEO & AI STRATEGY (Explicitly Allowed) # Note: Each named bot group repeats the same search-page blocks # so they do not bypass the global rules. # --------------------------------------------------- # OpenAI User-agent: OAI-SearchBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / User-agent: GPTBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # Google AI controls (robots token) User-agent: Google-Extended Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # Perplexity User-agent: PerplexityBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # Anthropic User-agent: Claude-SearchBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / User-agent: ClaudeBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # Meta User-agent: FacebookBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # Common Crawl User-agent: CCBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # --------------------------------------------------- # STANDARD SEARCH ENGINES # --------------------------------------------------- User-agent: Googlebot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / User-agent: Bingbot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / User-agent: DuckDuckBot Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* Allow: / # --------------------------------------------------- # GLOBAL RULES (All other bots) # --------------------------------------------------- User-agent: * Disallow: /site-search/ Disallow: /*?search=* Disallow: /*&search=* # --------------------------------------------------- # SITEMAPS # --------------------------------------------------- Sitemap: https://www.capilanou.ca/sitemap-en.xml