# Last Updated SM 18/08/2025 User-Agent: * User-agent: SEOTestingBot/1.0.0.0 Allow: / User-agent: Mediapartners-Google* Allow: / User-agent: Googlebot-Image Allow: /wp-content/uploads/ User-agent: Adsbot-Google Allow: / User-agent: Googlebot-Mobile Allow: / # Allow: /?display=wide Allow: /wp-content/uploads/ # ——— OPENAI ——— # Search (shows my webpages as links inside ChatGPT search). NOT used for model training. User-agent: OAI-SearchBot Allow: / # User-driven browsing from ChatGPT and Custom GPTs. Acts after a human click. User-agent: ChatGPT-User User-agent: ChatGPT-User/2.0 Allow: / # Model-training crawler. Opt-out here if I don’t want content in GPT-4o or GPT-5. User-agent: GPTBot Disallow: /private/ # example private folder Allow: / # everything else # ——— ANTHROPIC (Claude) ——— User-agent: anthropic-ai # bulk model training Allow: / User-agent: ClaudeBot # chat citation fetch User-agent: claude-web # web-focused crawl Allow: / # ——— PERPLEXITY ——— User-agent: PerplexityBot # index builder Allow: / User-agent: Perplexity-User # human-triggered visit Allow: / # ——— GOOGLE (Gemini) ——— User-agent: Google-Extended Allow: / # ——— MICROSOFT (Bing / Copilot) ——— User-agent: BingBot Allow: / # ——— AMAZON ——— User-agent: Amazonbot Allow: / # ——— APPLE ——— User-agent: Applebot User-agent: Applebot-Extended Allow: / # ——— META ——— User-agent: FacebookBot User-agent: meta-externalagent Allow: / # ——— LINKEDIN ——— User-agent: LinkedInBot Allow: / # ——— BYTEDANCE ——— User-agent: Bytespider Allow: / # ——— DUCKDUCKGO ——— User-agent: DuckAssistBot Allow: / # ——— COHERE ——— User-agent: cohere-ai Allow: / # ——— ALLEN INSTITUTE / COMMON CRAWL / OTHER RESEARCH ——— User-agent: AI2Bot User-agent: CCBot User-agent: Diffbot User-agent: omgili Allow: / # ——— EMERGING SEARCH START-UPS ——— User-agent: TimpiBot User-agent: YouBot Allow: / Disallow: /thanks/ Disallow: /thanks-much/ Disallow: /html/* Disallow: /images/* Disallow: /wp-content/updraft/* Disallow: /wp-content/wflogs/* Disallow: /wp-admin/* Disallow: /wp-json/ Disallow: /wp-includes/* Disallow: /wp-content/plugins/* Disallow: /wp-content/cache/* Disallow: /wp-register.php Disallow: /wp-login/ Disallow: /wp-content/themes/* Disallow: /jquery/* Disallow: /sites/all/* Disallow: /calculators/common/js/ Disallow: /search/ Disallow: /readme.html Disallow: /wp-trackback.php Disallow: /xmlrpc.php # separate directive for the main script file of WP Disallow: /*.php$ Disallow: /*.inc$ Disallow: /?p=* Disallow: /*? Disallow: /*= #Sitemaps Sitemap: https://www.buildingsguide.com/sitemap.xml