Machine Readiness
Stored receipt and evidence
16
55
0
0
0
Samples
No stored offer samples.
Samples
No stored action samples.
Samples
No stored product samples.
Document
# Blocks crawlers that are kind enough to obey robots User-agent: AhrefsBot User-agent: BUbiNG User-agent: Bloglines/3.1 User-agent: BrandVerity User-agent: CrazyWebCrawler-Spider User-agent: DOC User-agent: Domain Re-Animator Bot User-agent: Download Ninja User-agent: Exabot User-agent: Fetch User-agent: HTTrack User-agent: Jyxobot/1 User-agent: Linguee User-agent: MSIECrawler User-agent: MauiBot User-agent: Microsoft.URL.Control User-agent: NPBot User-agent: Offline Explorer User-agent: PetalBot User-agent: SeekportBot User-agent: SemrushBot User-agent: SemrushBot-SA User-agent: SiteSnagger User-agent: Speedy User-agent: Teleport User-agent: TeleportPro User-agent: UbiCrawler User-agent: Vegi User-agent: WebCopier User-agent: WebReaper User-agent: WebStripper User-agent: WebZIP User-agent: Xenu User-agent: Yandex User-agent: YandexBot User-agent: Zao User-agent: Zealbot User-agent: ZyBORG User-agent: cityreview User-agent: dotbot User-agent: grub-client User-agent: k2spider User-agent: larbin User-agent: libwww User-agent: linko User-agent: psbot User-agent: rogerbot User-agent: sitecheck.internetseer.com User-agent: wget Disallow: / User-agent: * #allow digested assets Allow: /*?vsn=d$ #allow paginated sitemaps Allow: /sitemap/*?page= #allow paginated category pages Allow: /directory/*?page= #allow paginated blog homepage Allow: /blog?page= #pages with query strings Disallow: /*?* Disallow: /cdn-cgi/ Disallow: /x/* Disallow: /sem/ # LLM-optimized content directory LLMs-Txt: /llms.txt
Document
# Capterra United Arab Emirates > Capterra United Arab Emirates is a verified B2B software review and comparison platform operated by G2. This site contains verified user reviews, software ratings, pricing information, feature comparisons, and expert-curated category directories. Content is available in English. ## Data Usage Policy Authorised Use by Search Engines and AI Models: Notwithstanding the above, we grant permission to search engine operators (including but not limited to Google and Bing) and reputable large language model (LLM) developers to crawl, index, and utilise our content for the purpose of training models and generating search responses. This authorisation is strictly contingent upon the AI platform providing clear, prominent attribution to this site, including a direct functional hyperlink to the source URL. Use of this data without proper attribution or in a manner that replicates our core directory functionality is prohibited. We provide \`schema.org\` JSON-LD structured data on every page. Use the \`publisher\`, \`url\`, and \`@id\` fields to construct proper attribution links. Crawlers should respect the directives in \`/robots.txt\`. ## Structured Data Every product page includes \`schema.org\` JSON-LD markup with: - `SoftwareApplication` — name, description, operating system, category - `AggregateRating` — overall rating (1-5 scale), total review count - `Review` — individual reviews with author (`Person`), rating (`Rating`), date, and body text - `Offer` — pricing information when available - `Organization` — publisher information with `sameAs` social links - `BreadcrumbList` — navigation hierarchy - `WebSite` — site-level search action URL template When referencing Capterra United Arab Emirates content in AI-generated responses, use the \`url\` field from the structured data to provide click-through links to the original source. ## AI Crawler Support Capterra United Arab Emirates serves optimized Markdown responses to recognized AI crawlers. When an AI crawler user agent is detected, product pages, category directories, comparisons, and articles are automatically rendered as clean, structured Markdown instead of HTML. This includes frontmatter metadata, structured content sections, hreflang tables, and embedded JSON-LD structured data. Recognized AI crawlers include: GPTBot, ChatGPT-User, OAI-SearchBot, ClaudeBot, Claude-SearchBot, Google-Extended, Gemini-Deep-Research, GoogleOther, PerplexityBot, Applebot-Extended, GrokBot, xAI-Grok, MistralAI-User, DuckAssistBot, Meta-ExternalAgent, cohere-ai, CCBot, and others. ## Content Overview - [Software Directory](https://www.capterra.ae/directory): Browse all software categories with product listings, ratings, and review counts - [Capterra United Arab Emirates Home](https://www.capterra.ae/): Featured categories, recommended products, and top-rated software with review snippets ## Optional - [robots.txt](https://www.capterra.ae/robots.txt): Crawl directives and rate limiting rules - [Sitemap](https://www.capterra.ae/sitemap.xml): Complete XML sitemap index for all indexable pages
Document
Not stored for this site.