# HabitusNet - habitus.net # Search engines and social platforms: welcome # AI training crawlers: blocked via WAF (see llms.txt for ecosystem access) User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: Twitterbot Allow: / User-agent: facebookexternalhit Allow: / User-agent: LinkedInBot Allow: / # AI ecosystem integration (not training/scraping) # These bots can access /.well-known/, /llms.txt, /api/ecosystem/ only # All other paths blocked via Cloudflare WAF rules User-agent: ClaudeBot Allow: /.well-known/ Allow: /llms.txt Allow: /llms-full.txt Allow: /api/ecosystem/ Disallow: / User-agent: GPTBot Allow: /.well-known/ Allow: /llms.txt Allow: /llms-full.txt Allow: /api/ecosystem/ Disallow: / User-agent: CCBot Disallow: / User-agent: Bytespider Disallow: / # Default: allow crawling User-agent: * Allow: / Disallow: /admin Disallow: /auth Disallow: /api/ Sitemap: https://habitus.net/sitemap.xml