# HOOBOX — robots.txt # Default rule User-agent: * Allow: / Disallow: /admin Disallow: /preview Disallow: /obrigado Disallow: /thank-you # ==================================== # AI Crawlers — explicitly allowed # ==================================== # OpenAI / ChatGPT (training) User-agent: GPTBot Allow: / # OpenAI / ChatGPT Search (real-time) User-agent: OAI-SearchBot Allow: / # OpenAI / ChatGPT user-agent (browsing) User-agent: ChatGPT-User Allow: / # Anthropic / Claude User-agent: ClaudeBot Allow: / User-agent: anthropic-ai Allow: / User-agent: Claude-Web Allow: / # Perplexity User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / # Google AI Overviews / Gemini User-agent: Google-Extended Allow: / # Apple Intelligence User-agent: Applebot-Extended Allow: / # Common Crawl (used by many LLMs) User-agent: CCBot Allow: / # Meta AI / Llama User-agent: FacebookBot Allow: / User-agent: Meta-ExternalAgent Allow: / # You.com User-agent: YouBot Allow: / # ByteDance / Doubao User-agent: Bytespider Allow: / # Mistral AI User-agent: MistralAI-User Allow: / # DuckDuckGo AI User-agent: DuckAssistBot Allow: / # ==================================== # Sitemap # ==================================== Sitemap: https://hoobox.one/sitemap.xml