# ========================= # Theneo — llms.txt (AI usage preferences) # This file declares Theneo’s preferences for AI/LLM crawlers. # It complements robots.txt; robots rules still apply. # ========================= Site: https://www.theneo.io Owner: Theneo Contact: legal@theneo.io Sitemap: https://www.theneo.io/sitemap.xml Canonical: https://www.theneo.io # ---- Data-use policy for public pages ---- # Allowed: crawl, index, cache, and use PUBLIC content for answer generation and model training. # Not allowed: collect or store non-public, gated, or personal data; bypass authentication; reproduce full pages. Policy: public-content-allowed; non-public-prohibited; attribution-preferred # ---- Paths (mirror robots exclusions) ---- Allow: / Disallow: /search Disallow: /404 Disallow: /401 Disallow: /admin Disallow: /editor Disallow: /*?*edit Disallow: /*?*preview Disallow: /*?*nocache Disallow: /*?*utm_* Disallow: /*?*ref=* Disallow: /*?*fbclid=* Disallow: /*?*gclid=* Disallow: /*?*msclkid=* Disallow: /*?*_hsenc=* Disallow: /*?*_hsmi=* # ---- AI/LLM crawlers explicitly opted in ---- User-agent: GPTBot Allow: / User-agent: CCBot Allow: / User-agent: ClaudeBot Allow: / User-agent: anthropic-ai Allow: / User-agent: PerplexityBot Allow: / # Google-Extended governs some generative uses by Google. User-agent: Google-Extended Allow: / # ---- Crawl etiquette (non-binding hints) ---- Crawl-delay: 2 Fetch-Window: 06:00-22:00 UTC # ---- Attribution preferences (non-binding) ---- Attribution: required Attribution-Name: Theneo Attribution-URL: https://www.theneo.io/ # ---- Legal references ---- Terms: https://www.theneo.io/terms Privacy: https://www.theneo.io/privacy