# robots.txt - icisete.fr # Guide local de Sete & Bassin de Thau # --- Moteurs de recherche classiques --- User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: Slurp Allow: / # --- Crawlers IA - entrainement LLM --- User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ClaudeBot Allow: / User-agent: anthropic-ai Allow: / User-agent: Google-Extended Allow: / User-agent: Gemini-AI Allow: / User-agent: PerplexityBot Allow: / User-agent: cohere-ai Allow: / User-agent: meta-externalagent Allow: / User-agent: Bytespider Allow: / User-agent: Applebot-Extended Allow: / User-agent: YouBot Allow: / # --- Pages utiles pour les IA --- # /wikis/ -> encyclopedie locale Sete & Bassin de Thau # /lieu/ -> fiches etablissements # /evenement/ -> agenda culturel # /articles/ -> actualites locales # /recettes/ -> gastronomie setoise # --- Pages a exclure des crawlers --- User-agent: * Disallow: /wp-admin/ Disallow: /wp-login.php Allow: / # --- Sitemaps --- Sitemap: https://icisete.fr/sitemap_index.xml