Top SitesThe Food Institute - Your Source for Food Industry News, Data, & Trends

Machine Readiness

Stored receipt and evidence

Overall

20

Readable

65

Callable

0

Commerce

0

Payment

0

Machine Access

Inspect the site's MCP endpoint

Open MCP explorer

DialtoneApp can scan the stored discovery files for this domain, try the MCP initialize handshake, and show the raw protocol transcript.

Purchase boundary

read only

Control boundary

unknown

Payment rails

None

Payment providers

None

Payment methods

None

Payment protocols

None

Payment assets

None

Payment networks

None

Capabilities

None

Verified payment surface

No

Crypto only

No

Readable docs

robots, llms

Products

0

Variants

0

Priced variants

0

Currencies

0

Offers

0

Priced offers

0

Priced actions

0

Samples

Offer samples

No stored offer samples.

Samples

Action samples

No stored action samples.

Samples

Product samples

No stored product samples.

Document

robots.txt

Open robots.txt
# Food Institute - Robots.txt
# Updated: 2026-01-11

# Allow major search engines (Google, Bing) full access
User-agent: Googlebot
Allow: /

User-agent: Bingbot
Allow: /

# AI Training Bots - Follow same rules as llms.txt
User-agent: GPTBot
User-agent: ChatGPT-User
User-agent: OAI-SearchBot
User-agent: ClaudeBot
User-agent: Claude-User
User-agent: Claude-SearchBot
User-agent: Google-Extended
User-agent: GoogleOther
User-agent: Applebot-Extended
User-agent: Meta-ExternalAgent
User-agent: FacebookBot
User-agent: cohere-ai
User-agent: Diffbot
User-agent: anthropic-ai
Disallow: /wp-admin/
Disallow: /wp-includes/
Disallow: /wp-content/plugins/
Disallow: /wp-content/themes/
Disallow: /wp-content/uploads/ewww/
Disallow: /wp-content/uploads/ewww-3/
Disallow: /wp-json/
Disallow: /ewww/
Disallow: /ewww-3/
Disallow: /cgi-bin/
Disallow: /.well-known/
Disallow: /securefile/
Disallow: /.sucuriquarantine/
Disallow: /reports/2020/
Disallow: /reports/2021/
Disallow: /reports/2022/
Disallow: /reports/2023/
Disallow: /reports/eco/
Disallow: /reports/fir/
Disallow: /reports/du_pdf/
Disallow: /reports/join/
Disallow: /reports/mcith/
Disallow: /food1/
Disallow: /test-site/
Disallow: /feed/
Disallow: /trackback/
Disallow: /xmlrpc.php
Disallow: /*?
Disallow: /*.php$
Disallow: /*.js$
Disallow: /*.css$
Disallow: /comments/
Disallow: /author/
Disallow: /tag/
Disallow: /page/

# Commercial SEO Bots - Block entirely (also in .htaccess)
User-agent: AhrefsBot
User-agent: SemrushBot
User-agent: SERankingBot
User-agent: MJ12bot
User-agent: DotBot
User-agent: BLEXBot
User-agent: SEOkicks
User-agent: Barkrowler
User-agent: Netvibes
User-agent: Amazonbot
Disallow: /

# Known Rule-Breakers - Block entirely (also in .htaccess)
User-agent: PerplexityBot
User-agent: Perplexity-User
User-agent: Bytespider
User-agent: CCBot
User-agent: Omgilibot
User-agent: webzio-extended
User-agent: ImagesiftBot
Disallow: /

# Sitemap location (optional - WordPress generates this)
Sitemap: https://foodinstitute.com/sitemap.xml

Document

llms.txt

Open llms.txt
# Food Institute
# Leading food industry news and analysis platform
# https://foodinstitute.com

# Key Resources for AI Indexing
## [Latest News & Articles](https://foodinstitute.com/)
- Breaking food industry news, market analysis, and trends
## [Industry Reports](https://foodinstitute.com/reports/)
- Comprehensive market intelligence and research
## [Daily Updates](https://foodinstitute.com/reports/dailyupdate/)
- Daily food industry briefings and market insights
## [About Us](https://foodinstitute.com/about/)
- Learn about Food Institute's mission and services

# Company Information
- Company: Food Institute
- Focus: Food industry news, analysis, and market intelligence
- Content: Industry reports, daily briefings, market trends

# AI Crawling Policy - OPTIMIZED FOR BANDWIDTH

User-agent: *

# WordPress Core - BLOCK (No value for AI, wastes bandwidth)
Disallow: /wp-admin/              # WordPress admin
Disallow: /wp-includes/           # WordPress core files
Disallow: /wp-content/plugins/    # Plugin files
Disallow: /wp-content/themes/     # Theme files
Disallow: /wp-content/uploads/ewww/    # Image optimization cache
Disallow: /wp-content/uploads/ewww-3/  # Image optimization cache
Disallow: /wp-json/               # REST API endpoints

# Image Optimization Caches - BLOCK
Disallow: /ewww/                  # EWWW cache directory
Disallow: /ewww-3/                # EWWW cache directory (duplicate)

# Security & System - BLOCK
Disallow: /cgi-bin/               # CGI scripts
Disallow: /.well-known/           # SSL validation files
Disallow: /securefile/            # Security files
Disallow: /.sucuriquarantine/     # Sucuri quarantine

# Archive Optimization - BLOCK OLD CONTENT
Disallow: /reports/2020/          # Outdated reports (5+ years old)
Disallow: /reports/2021/          # Outdated reports (4+ years old)
Disallow: /reports/2022/          # Outdated reports (3+ years old)
Disallow: /reports/2023/          # Older reports (2+ years old)

# Test/Cache Directories - BLOCK
Disallow: /reports/eco/           # Old/test content
Disallow: /reports/fir/           # Old/test content
Disallow: /reports/du_pdf/        # PDF cache directory
Disallow: /reports/join/          # Join/signup forms
Disallow: /reports/mcith/         # MCITH content
Disallow: /test-site/		  # test site

# Subdomain - BLOCK (staging/test sites)
Disallow: /food1/                 # food1-co subdomain files
Disallow: /test-site/             # test site

# Functional Pages - BLOCK (No content value)
Disallow: /feed/                  # RSS feeds
Disallow: /trackback/             # Trackback endpoints
Disallow: /xmlrpc.php             # XML-RPC
Disallow: /*?                     # URLs with query parameters (search, etc)
Disallow: /*.php                  # PHP files
Disallow: /*.js                   # JavaScript files
Disallow: /*.css                  # Stylesheets
Disallow: /*.inc                  # Include files

# User-Generated/Dynamic Content - BLOCK
Disallow: /comments/              # Comment pages
Disallow: /author/                # Author archives
Disallow: /tag/                   # Tag pages
Disallow: /page/                  # Pagination

# Everything else is allowed (homepage, /about/, /reports/2024/, /reports/2025/, etc.)

# Training Guidelines
Training-Data: allowed
Commercial-Use: allowed-with-attribution
Attribution: required
Modification: allowed
Distribution: allowed-with-source-link
Data-Collection-Consent: implicit

# Explanation
We allow AI training on our current news articles and industry reports (2024-2025) to 
help spread food industry knowledge. Older archives and WordPress system files are 
excluded to reduce unnecessary bandwidth consumption.

# Metadata
Crawl-delay: 2
Categories: food-industry, news, market-intelligence, reports, analysis
Last-modified: 2026-01-11
Version: 1.0
Content-Focus: food-industry-news, market-analysis, daily-updates

# Custom Directives
Research: encouraged
Educational-use: encouraged
Commercial-training: allowed-with-attribution
User-generated-content: excluded
System-files: excluded
Archive-content: exclude-before-2024

# Preferred Indexing
Priority-content: /reports/dailyupdate/, /reports/2024/, /reports/2025/, /reports/marketinfo/
Update-frequency: daily

Document

llms-full.txt

Not stored for this site.