#robots de Quo.es #es necesario personalizar algunas opciones o puede dar problemas # Bloqueo basico para todos los bots y crawlers # puede dar problemas por bloqueo de recursos en GWT User-agent: * User-agent: Googlebot User-agent: Googlebot-News User-agent: Googlebot-Image User-agent: Googlebot-Video Allow: /wp-content/uploads/* Allow: /wp-content/*.js Allow: /wp-content/*.css Allow: /wp-includes/*.js Allow: /wp-includes/*.css Allow: /*.css$ Allow: /*.js$ # === Bots de IA generativa (LLMs, asistentes y buscadores) === User-agent: GPTBot User-agent: ChatGPT-User User-agent: OAI-SearchBot User-agent: ClaudeBot User-agent: Claude-SearchBot User-agent: Claude-User User-agent: PerplexityBot User-agent: Perplexity-User User-agent: PhindBot User-agent: SageBot User-agent: AndiBot User-agent: Quora-Bot User-agent: ExaBot User-agent: DuckAssistBot User-agent: Google-Extended User-agent: Applebot-Extended User-agent: Facebookbot User-agent: Meta-ExternalAgent User-agent: Meta-ExternalFetcher Allow: / Disallow: /cgi-bin Disallow: /app Disallow: /app28392830482 Disallow: /vendor Disallow: /api/* Disallow: /api Disallow: /*/attachment/ Disallow: /tag/*/page/ Disallow: /tag/*/feed/ Disallow: /page/ Disallow: /comments/ Disallow: /xmlrpc.php Disallow: /?attachment_id* Disallow: /*/feed/ # Previene problemas de recursos bloqueados en Google Webmaster Tools #Bloqueo de busquedas User-agent: * Disallow: /?s= Disallow: /search # Bloqueo de trackbacks User-agent: * Disallow: /trackback Disallow: /*trackback Disallow: /*trackback* Disallow: /*/trackback # Bloqueo de feeds para crawlers User-agent: * Allow: /feed/$ Disallow: /feed/ Disallow: /comments/feed/ Disallow: /*/feed/$ Disallow: /*/feed/rss/$ Disallow: /*/trackback/$ Disallow: /*/*/feed/$ Disallow: /*/*/feed/rss/$ Disallow: /*/*/trackback/$ Disallow: /*/*/*/feed/$ Disallow: /*/*/*/feed/rss/$ Disallow: /*/*/*/trackback/$ Disallow: /rss/* Disallow: /rss # Ralentizamos algunos bots que se suelen volver locos User-agent: noxtrumbot Crawl-delay: 20 User-agent: msnbot Crawl-delay: 20 User-agent: Slurp Crawl-delay: 20 # Bloqueo de bots y crawlers poco utiles User-agent: MSIECrawler Disallow: / User-agent: WebCopier Disallow: / User-agent: HTTrack Disallow: / User-agent: Microsoft.URL.Control Disallow: / User-agent: libwww Disallow: / User-agent: Orthogaffe Disallow: / User-agent: UbiCrawler Disallow: / User-agent: DOC Disallow: / User-agent: Zao Disallow: / User-agent: sitecheck.internetseer.com Disallow: / User-agent: Zealbot Disallow: / User-agent: MSIECrawler Disallow: / User-agent: SiteSnagger Disallow: / User-agent: WebStripper Disallow: / User-agent: WebCopier Disallow: / User-agent: Fetch Disallow: / User-agent: Offline Explorer Disallow: / User-agent: Teleport Disallow: / User-agent: TeleportPro Disallow: / User-agent: WebZIP Disallow: / User-agent: linko Disallow: / User-agent: HTTrack Disallow: / User-agent: Microsoft.URL.Control Disallow: / User-agent: Xenu Disallow: / User-agent: larbin Disallow: / User-agent: libwww Disallow: / User-agent: ZyBORG Disallow: / User-agent: Download Ninja Disallow: / User-agent: wget Disallow: / User-agent: grub-client Disallow: / User-agent: k2spider Disallow: / User-agent: NPBot Disallow: / User-agent: WebReaper Disallow: / User-agent: grapeshot Disallow: User-agent: MJ12bot Disallow: / User-agent: DotBot Disallow: / User-agent: dotbot Disallow: / User-agent: AhrefsBot Disallow: / User-agent: Squidbot Disallow: / User-agent: Mozilla/5.0 (compatible; AhrefsBot/6.1; +http://ahrefs.com/robot/) Disallow: / User-agent: SemrushBot Disallow: / User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Amazonbot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: ClaudeBot Disallow: / User-agent: GPTBot Disallow: / User-agent: PetalBot Disallow: / User-agent: uptimerobot Disallow: / User-agent: viberbot Disallow: / User-agent: YaK Disallow: / User-agent: Yandex Disallow: / User-agent: GnowitNewsbot Disallow: / User-agent: MoodleBot Disallow: / user-agent: Pinterestbot Disallow: / User-agent: CriteoBot/0.1 Disallow: User-agent: EvincedBot Disallow: / Crawl-delay: 3 # En condiciones normales este es el sitemap Sitemap: https://quo.eldiario.es/sitemap.xml Sitemap: https://quo.eldiario.es/news-sitemap.xml # Si utilizas Yoast SEO estos son los sitemaps principales Sitemap: https://quo.eldiario.es/sitemap_index.xml Sitemap: https://quo.eldiario.es/category-sitemap.xml Sitemap: https://quo.eldiario.es/page-sitemap.xml Sitemap: https://quo.eldiario.es/post-sitemap.xml