# WPROST.pl - Polska i świat, historia, polityka, biznes, kultura

> Markdown mirror of DialtoneApp's public top-site detail page for `wprost.pl`.

URL: https://dialtoneapp.com/top-sites/wprost.pl/index.md
Canonical HTML: https://dialtoneapp.com/top-sites/wprost.pl

## Summary

- Domain: `wprost.pl`
- Website: https://wprost.pl
- Description: ai readable | score 16 | purchase read only
- Label: ai_readable
- Payment surface: Not available
- Purchase boundary: read_only
- Control boundary: unknown
- Rank: 28028

## robots

~~~text
#disable at search level
User-agent: *
Disallow: /szukaj/
Disallow: /wyszukaj/

# Allowed search engines directives
User-agent: Mediapartners-Google
Disallow:

User-agent: Googlebot
Disallow:

User-agent: Googlebot-Image
Disallow:

User-agent: Googlebot-Mobile
Disallow:

User-agent: Googlebot-News
Disallow:

User-agent: Googlebot-Video
Disallow:

User-agent: Adsbot-Google
Disallow:

User-Agent: Googlebot_Nauxeo
Disallow:

User-agent: Twitterbot
Disallow:

User-agent: Applebot
Disallow:

User-agent: Ouestfrancebot
Disallow:

User-agent: Taboolabot
Disallow:

User-agent: Proximic
Disallow:

User-agent: upday
Disallow:

User-agent: Bingbot
Disallow:

# Crawlers that are kind enough to obey, but which we'd rather not have
# unless they're feeding search engines.
User-agent: UbiCrawler
Disallow: /

User-agent: DOC
Disallow: /

User-agent: Zao
Disallow: /

# Some bots are known to be trouble, particularly those designed to copy
# entire sites. Please obey robots.txt.
User-agent: sitecheck.internetseer.com
Disallow: /

User-agent: Zealbot
Disallow: /

User-agent: MSIECrawler
Disallow: /

User-agent: SiteSnagger
Disallow: /

User-agent: WebStripper
Disallow: /

User-agent: WebCopier
Disallow: /

User-agent: Fetch
Disallow: /

User-agent: Offline Explorer
Disallow: /

User-agent: Teleport
Disallow: /

User-agent: TeleportPro
Disallow: /

User-agent: WebZIP
Disallow: /

User-agent: linko
Disallow: /

User-agent: HTTrack
Disallow: /

User-agent: Microsoft.URL.Control
Disallow: /

User-agent: Xenu
Disallow: /

User-agent: larbin
Disallow: /

User-agent: libwww
Disallow: /

User-agent: ZyBORG
Disallow: /

User-agent: Download Ninja
Disallow: /

# Misbehaving: requests much too fast:
User-agent: fast
Disallow: /

#
# Sorry, wget in its recursive mode is a frequent problem.
# Please read the man page and use it properly; there is a
# --wait option you can use to set the delay between hits,
# for instance.
#
User-agent: wget
Disallow: /

#
# The 'grub' distributed client has been *very* poorly behaved.
#
User-agent: grub-client
Disallow: /

#
# Doesn't follow robots.txt anyway, but...
#
User-agent: k2spider
Disallow: /

#
# Hits many times per second, not acceptable
# http://www.nameprotect.com/botinfo.html
User-agent: NPBot
Disallow: /

# A capture bot, downloads gazillions of pages with no public benefit
# http://www.webreaper.net/
User-agent: WebReaper
Disallow: /


User-agent: *
Crawl-delay: 2

User-agent: *
Disallow:
~~~

## llms

~~~text
User-Agent: *
Train: disallow
Query: disallow
Crawl: disallow
Parse: disallow
Generate: disallow
~~~

## llms-full

Not found.