# ===================================================== # linuxsecurity.com robots.txt # Optimized for large content sites # ===================================================== User-agent: * Allow: /*.js$ Allow: /*.css$ Allow: /*.png$ Allow: /*.jpg$ Allow: /*.gif$ Allow: /sitemapindex_xml.xml Allow: /sitemapyears_xml.xml Disallow: /administrator/ Disallow: /bin/ Disallow: /cli/ Disallow: /installation/ Disallow: /language/ Disallow: /layouts/ Disallow: /logs/ Disallow: /tmp/ Sitemap: https://linuxsecurity.com/sitemapindex_xml.xml Sitemap: https://linuxsecurity.com/sitemapyears_xml.xml # --- Googlebot --- User-agent: Googlebot Allow: / Crawl-delay: 1 # --- Bingbot --- User-agent: Bingbot Allow: / Crawl-delay: 2 # --- AhrefsBot --- User-agent: AhrefsBot Allow: / Disallow: /administrator/ Disallow: /tmp/ Crawl-delay: 2 # --- SemrushBot --- User-agent: SemrushBot Allow: / Disallow: /administrator/ Disallow: /tmp/ Crawl-delay: 3 # --- MJ12bot --- User-agent: MJ12bot Disallow: /administrator/ Disallow: /tmp/ Crawl-delay: 5 # --- DotBot --- User-agent: DotBot Allow: / Disallow: /administrator/ Disallow: /tmp/ Crawl-delay: 3 # --- YandexBot --- User-agent: YandexBot Allow: / Disallow: /administrator/ Crawl-delay: 4 # --- Sogou, Baiduspider, and others --- User-agent: Sogou web spider Disallow: / User-agent: Baiduspider Disallow: / # --- Generic fallback for unknown bots --- User-agent: * Crawl-delay: 10