# AI Search & Assistant Bots (Citation/Real-time Use) User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: ClaudeBot Allow: / User-agent: claude-web Allow: / User-agent: FirecrawlAgent Allow: / User-agent: AndiBot Allow: / User-agent: PhindBot Allow: / User-agent: YouBot Allow: / User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / # AI Training Data Collection Bots User-agent: GPTBot Allow: / User-agent: anthropic-ai Allow: / User-agent: CCBot Allow: / User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / User-agent: img2dataset Allow: / User-agent: AI2Bot Allow: / User-agent: Ai2Bot-Dolma Allow: / User-agent: Omgilibot Allow: / User-agent: Omgili Allow: / User-agent: magpie-crawler Allow: / User-agent: cohere-ai Allow: / User-agent: Diffbot Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: FacebookBot Allow: / User-agent: Bytespider Allow: / # Traditional Search Engines User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckBot Allow: / User-agent: YandexBot Allow: / User-agent: Baiduspider Allow: / User-agent: Sogou web spider Allow: / # Everything else User-agent: * Allow: / Sitemap: https://aloware.com/sitemap.xml