Machine Readiness
Stored receipt and evidence
20
65
0
0
0
Samples
No stored offer samples.
Samples
No stored action samples.
Samples
No stored product samples.
Document
# Set the crawl delay to 5 seconds - not all search engines will honour this User-agent: * Crawl-delay: 5 # Tell all user agents to ignore wp-admin User-agent: * Disallow: /wp-admin/ # Tell all user agents to ignore URLs with querystrings # IMPORTANT: Review this rule. It blocks all URLs with a '?' including # pagination, filters, and some search results that might be important to you. User-agent: * Disallow: /? # NEW: Exclude /staff/ pages and all sub-pages User-agent: * Disallow: /staff/ Disallow: /careers/ # ---------------------------------------------------- # BLOCK SPECIFIC BOTS (AI, Scrapers, Malicious, etc.) # IMPORTANT: Each bot must have its own "User-agent:" and "Disallow: /" lines. # This list is comprehensive. Consider if all of these need full site blocking. # ---------------------------------------------------- # AI Bots (originally commented out, now explicitly blocked if uncommented) # If you want to block these, uncomment each block below. # User-agent: AI2Bot # Disallow: / # User-agent: Ai2Bot-Dolma # Disallow: / # User-agent: aiHitBot # Disallow: / # User-agent: Amazonbot # Disallow: / # User-agent: Andibot # Disallow: / # User-agent: anthropic-ai # Disallow: / # User-agent: Applebot # Disallow: / # User-agent: Applebot-Extended # Disallow: / # User-agent: Brightbot 1.0 # Disallow: / # User-agent: Bytespider # Disallow: / # User-agent: CCBot # Disallow: / # User-agent: ChatGPT-User # Disallow: / # User-agent: Claude-SearchBot # Disallow: / # User-agent: Claude-User # Disallow: / # User-agent: Claude-Web # Disallow: / # User-agent: ClaudeBot # Disallow: / # User-agent: cohere-ai # Disallow: / # User-agent: cohere-training-data-crawler # Disallow: / # User-agent: Cotoyogi # Disallow: / # User-agent: Crawlspace # Disallow: / # User-agent: Diffbot # Disallow: / # User-agent: DuckAssistBot # Disallow: / # User-agent: FacebookBot # Disallow: / # User-agent: Factset_spyderbot # Disallow: / # User-agent: FirecrawlAgent # Disallow: / # User-agent: FriendlyCrawler # Disallow: / # User-agent: Google-CloudVertexBot # Disallow: / # User-agent: Google-Extended # Disallow: / # User-agent: GoogleOther # Disallow: / # User-agent: GoogleOther-Image # Disallow: / # User-agent: GoogleOther-Video # Disallow: / # User-agent: GPTBot # Disallow: / # User-agent: iaskspider/2.0 # Disallow: / # User-agent: ICC-Crawler # Disallow: / # User-agent: ImagesiftBot # Disallow: / # User-agent: img2dataset # Disallow: / # User-agent: ISSCyberRiskCrawler # Disallow: / # User-agent: Kangaroo Bot # Disallow: / # User-agent: meta-externalagent # Disallow: / # User-agent: Meta-ExternalAgent # Disallow: / # User-agent: meta-externalfetcher # Disallow: / # User-agent: Meta-ExternalFetcher # Disallow: / # User-agent: MistralAI-User/1.0 # Disallow: / # User-agent: NovaAct # Disallow: / # User-agent: OAI-SearchBot # Disallow: / # User-agent: omgili # Disallow: / # User-agent: omgilibot # Disallow: / # User-agent: Operator # Disallow: / # User-agent: PanguBot # Disallow: / # User-agent: Panscient # Disallow: / # User-agent: panscient.com # Disallow: / # User-agent: Perplexity-User # Disallow: / # User-agent: PerplexityBot # Disallow: / # User-agent: PetalBot # Disallow: / # User-agent: PhindBot # Disallow: / # User-agent: QualifiedBot # Disallow: / # User-agent: QuillBot # Disallow: / # User-agent: quillbot.com # Disallow: / # User-agent: SBIntuitionsBot # Disallow: / # User-agent: Scrapy # Disallow: / # User-agent: Sidetrade indexer bot # Disallow: / # User-agent: TikTokSpider # Disallow: / # User-agent: Timpibot # Disallow: / # User-agent: VelenPublicWebCrawler # Disallow: / # User-agent: Webzio-Extended # Disallow: / # User-agent: wpbot # Disallow: / # User-agent: YandexAdditional # Disallow: / # User-agent: YandexAdditionalBot # Disallow: / # User-agent: YouBot # Disallow: / # Other specific bots to block from the entire site User-agent: AITCSRoboti Disallow: / User-agent: Accoona Disallow: / User-agent: admantx Disallow: / User-agent: admantx-usaspb Disallow: / User-agent: adbeat_bot Disallow: / User-agent: aiHitBot Disallow: / User-agent: Amazonbot Disallow: / User-agent: Arachnophilia Disallow: / User-agent: AspiegelBot Disallow: / User-agent: AwarioSmartBot Disallow: / User-agent: BackDoorBot Disallow: / User-agent: BackRub Disallow: / User-agent: Baidu Disallow: / User-agent: BLEXbot Disallow: / User-agent: BLEXBot Disallow: / User-agent: BecomeBot Disallow: / User-agent: BlowFishi Disallow: / User-agent: BomboraBot Disallow: / User-agent: CatchBot Disallow: / User-agent: CCBot Disallow: / User-agent: CherryPicker Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Clickagy Disallow: / User-agent: Cliqzbot Disallow: / User-agent: coccocbot Disallow: / User-agent: ConveraCrawler Disallow: / User-agent: contxbot Disallow: / User-agent: CrowdTanglebot Disallow: / User-agent: CyberSpyder Disallow: / User-agent: DotBot Disallow: / User-agent: EchoboxBot Disallow: / User-agent: EmailCollector Disallow: / User-agent: Exabot Disallow: / User-agent: Eyeotabot Disallow: / User-agent: findlinks Disallow: / User-agent: Foobot Disallow: / User-agent: Genieo Disallow: / User-agent: GetURL Disallow: / User-agent: Gigabot Disallow: / User-agent: GrapeshotCrawler Disallow: / User-agent: GumGum Disallow: / User-agent: HTTrack Disallow: / User-agent: Huaweisymantecspider Disallow: / User-agent: IAScrawler Disallow: / User-agent: ImagesiftBot Disallow: / User-agent: JikeSpider Disallow: / User-agent: Jobboerse Disallow: / User-agent: Java Disallow: / User-agent: Jyxobot Disallow: / User-agent: Leikibot Disallow: / User-agent: LinkScan Disallow: / User-agent: LinkisBot Disallow: / User-agent: linkdexbot Disallow: / User-agent: linkfluence.com Disallow: / User-agent: LivelapBot Disallow: / User-agent: Mail.RU_Bot Disallow: / User-agent: MauiBot Disallow: / User-agent: MAZBot Disallow: / User-agent: MBCrawler Disallow: / User-agent: MegaIndex.ru Disallow: / User-agent: MJ12bot Disallow: / User-agent: MojeekBot Disallow: / User-agent: mtbot/1.1.0i Disallow: / User-agent: NerdyBot Disallow: / User-agent: Nimbostratus-Bot Disallow: / User-agent: NTENTbot Disallow: / User-agent: Offline Explorer Disallow: / User-agent: Onespot-ScraperBot Disallow: / User-agent: Openbot Disallow: / User-agent: OutclicksBot Disallow: / User-agent: PaperLiBot Disallow: / User-agent: perl Disallow: / User-agent: PetalBot Disallow: / User-agent: PlurkBot Disallow: / User-agent: proximic Disallow: / User-agent: Proximi Disallow: / User-agent: python Disallow: / User-agent: Quantcastboti Disallow: / User-agent: Qwantify Disallow: / User-agent: ScholarBot Disallow: / User-agent: Scrap Disallow: / User-agent: Screaming Frog SEO Spider Disallow: / User-agent: Semantici Disallow: / User-agent: SentiBot Disallow: / User-agent: SEOkicks Disallow: / User-agent: SEOkicks-Robot Disallow: / User-agent: SerendeputyBot Disallow: / User-agent: serpstatbot Disallow: / User-agent: SeznamBot Disallow: / User-agent: SiteCheck-sitecrawl Disallow: / User-agent: SiteSnagger Disallow: / User-agent: Snooper Disallow: / User-agent: Sogou Disallow: / User-agent: Sosospider Disallow: / User-agent: SuperBot Disallow: / User-agent: Taboolabot Disallow: / User-agent: TeleportPro Disallow: / User-agent: TkBot Disallow: / User-agent: TTD-Content Disallow: / User-agent: TweetmemeBot Disallow: / User-agent: URLSpiderPro Disallow: / User-agent: Vagabondo Disallow: / User-agent: VelenPublicWebCrawler Disallow: / User-agent: VoilaBot Disallow: / User-agent: VoluumDSP-content-bot Disallow: / User-agent: WebCopier Disallow: / User-agent: weborama-fetcher Disallow: / User-agent: WebReaper Disallow: / User-agent: WebStripper Disallow: / User-agent: WebZIP Disallow: / User-agent: Xaldon_WebSpider Disallow: / User-agent: YaK Disallow: / User-agent: Yandex Disallow: / User-agent: YandexBot Disallow: / User-agent: YandexImages Disallow: / User-agent: ZGrab Disallow: / User-agent: ZoominfoBot Disallow: / User-agent: Scrapy Disallow: / User-agent: Buck Disallow: / User-agent: TinyTestBot Disallow: / User-agent: SEMrushBot Disallow: / User-agent: AhrefsBot Disallow: / User-agent: PetalBot Disallow: / User-agent: MJ12Bot Disallow: / User-agent: DotBot Disallow: / User-agent: MauiBot Disallow: / User-agent: YandexBot Disallow: / User-agent: Baiduspider Disallow: / User-agent: Barkrowler Disallow: / User-agent: Bytespider Disallow: / User-agent: WhatStuffWhereBot Disallow: / User-agent: Applebot Disallow: / User-agent: Sogou Pic Spider/3.0( http://www.sogou.com/docs/help/webmasters.htm#07) Disallow: / User-agent: Sogou head spider/3.0( http://www.sogou.com/docs/help/webmasters.htm#07) Disallow: / User-agent: Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07) Disallow: / User-agent: User-agent: Sogou Orion spider/3.0( http://www.sogou.com/docs/help/webmasters.htm#07) Disallow: / User-agent: Sogou-Test-Spider/4.0 (compatible; MSIE 5.5; Windows 98) Disallow: / User-agent: Mozilla/5.0 (compatible; Konqueror/3.5; Linux) KHTML/3.5.5 (like Gecko) (Exabot-Thumbnails) Disallow: / User-agent: Mozilla/5.0 (compatible; Exabot/3.0; +http://www.exabot.com/go/robot) Disallow: / User-agent: Swiftbot Disallow: / User-agent: Slurp Disallow: / User-agent: CCBot/2.0 (https://commoncrawl.org/faq/) Disallow: / User-agent: CCBot/2.0 Disallow: / User-agent: CCBot/2.0 (http://commoncrawl.org/faq/) Disallow: / # ---------------------------------------------------- # EXPLICITLY ALLOWED BOTS # These bots will override any Disallow rules that precede them for their specific user-agent. # An empty Disallow: line means all content is allowed for that user-agent. # ---------------------------------------------------- User-agent: Googlebot Disallow: User-agent: Bingbot Disallow: User-agent: DuckDuckBot Disallow: # --------------------------------- # END BOOKSWARM ROBOTS.TXT TEMPLATE # _ # [ ] # ( ) # |>| # __/===\__ # //| o=o |\\ # <] | o=o | [> # \=====/ # / / | \ \ # <_________>
Document
# Nosy Crow: Independent Children's Book Publisher > Nurture a lifelong love of reading with Nosy Crow, the multi\-award\-winning publisher of child\-focused, parent\-friendly books for ages 0\-12\. Generated by Yoast SEO v27.4, this is an llms.txt file, meant for consumption by LLMs. ## Pages - [Homepage](https://nosycrow.com/) - [Catalogues](https://nosycrow.com/catalogue/) - [Terms and Conditions: Nosy Crow Survey Giveaway](https://nosycrow.com/terms-and-conditions-nosy-crow-survey-giveaway/) - [T\&Cs for Jasmine Green Prize Draw](https://nosycrow.com/terms-and-conditions-for-jasmine-green-prize-draw/) - [Make crystals like Dorothy Hodgkin](https://nosycrow.com/make-crystals-like-dorothy-hodgkin/) ## Posts - [Grow your child's early reading skills with books about spring](https://nosycrow.com/blog/grow-your-childs-early-reading-skills-with-books-about-spring/) - [Nosy Crow Blog: A Spotlight on Poetry: Inside "This Is Not a Small Voice"](https://nosycrow.com/blog/nosy-crow-blog-a-spotlight-on-poetry-inside-this-is-not-a-small-voice/) - [Celebrate World Book Day 2026 with Unicorn Academy: My Secret Unicorn Diary ](https://nosycrow.com/blog/celebrate-world-book-day-2026/) - [Nosy Crow Presence on the platform, X\.](https://nosycrow.com/blog/nosy-crow-presence-on-the-platform-x/) - [The Nosy Crow Production Team is Highly Commended at The Bookseller 2025 FutureBook Awards\!](https://nosycrow.com/blog/the-nosy-crow-production-team-is-highly-commended-at-the-bookseller-2025-futurebook-awards/) ## Contributors - [Gosia Herba](https://nosycrow.com/contributor/gosia-herba/) - [Lizzie Lomax](https://nosycrow.com/contributor/lizzie-lomax/) - [Em Lynas](https://nosycrow.com/contributor/em-lynas/) - [Paige Braddock](https://nosycrow.com/contributor/paige-braddock/) - [Camilla Reid](https://nosycrow.com/contributor/camilla-reid/) ## Curated Lists - [Nosy Crow's Top Farm Books for Ages 0\-3 Years](https://nosycrow.com/curatedlist/nosy-crows-top-farm-books-for-ages-0-3-years/) - [Books about Spring for Kids to Explore Nature](https://nosycrow.com/curatedlist/ten-books-about-spring-for-kids-to-explore-nature/) - [Our Favourite Easter Books for Children](https://nosycrow.com/curatedlist/our-favourite-easter-books-for-children/) - [5 Books for Young Bird Spotters](https://nosycrow.com/curatedlist/five-of-the-best-books-about-birds-for-children/) - [25 Children’s Books with Free Downloadable Activity Sheets](https://nosycrow.com/curatedlist/childrens-books-and-activities/) ## Resources - [British Museum titles](https://nosycrow.com/resource/british-museum-titles/) - [Bizzy Bear Titles](https://nosycrow.com/resource/bizzy-bear-titles/) - [Two is a Crowd Teaching Resource](https://nosycrow.com/resource/two-is-a-crowd-teaching-resource/) - [Unicorn Academy Titles](https://nosycrow.com/resource/unicorn-academy-titles/) - [Pip and Posy TV tie\-in titles](https://nosycrow.com/resource/pip-and-posy-tv-tie-in-titles/) ## Events - [Chris Naylor\-Ballesteros at Warwick Books](https://nosycrow.com/event/chris-naylor-ballesteros-at-warwick-books/) - [Chris Naylor\-Ballesteros at Bag of Books](https://nosycrow.com/event/chris-naylor-ballesteros-at-bag-of-books/) - [Chris Naylor\-Ballesteros at The Book Nook](https://nosycrow.com/event/chris-naylor-ballesteros-at-the-book-nook/) - [Chris Naylor\-Ballesteros at Pickled Pepper Books](https://nosycrow.com/event/frank-and-bert-2-author-event/) - [Chris Naylor\-Ballesteros at Imagined Things Bookshop](https://nosycrow.com/event/frank-and-bert-2-author-event-2/) ## Staff - [Miranda Baker](https://nosycrow.com/staff/miranda-baker/) - [Kirstie Williams](https://nosycrow.com/staff/hr-and-office-manager/) - [Charlotte Graver](https://nosycrow.com/staff/charlotte-graver/) - [Jess Chinn](https://nosycrow.com/staff/jess-chin/) - [Sophie Wallman](https://nosycrow.com/staff/sophie-wallm/) ## Jobs - [Inventory Assistant \(Operations\)](https://nosycrow.com/job/inventory-assistant-operations/) ## BM Words - [Coin](https://nosycrow.com/bmword/coin-2/) - [Parrot](https://nosycrow.com/bmword/parrot/) - [Goldfish](https://nosycrow.com/bmword/goldfish-2/) - [Porcupine](https://nosycrow.com/bmword/porcupine/) - [Rectangle](https://nosycrow.com/bmword/rectangle/) ## Homepage slides - [We Are Dragon](https://nosycrow.com/blog/homepage-slide/we-are-dragon/) - [Books about Spring for Kids to Explore Nature](https://nosycrow.com/blog/homepage-slide/books-about-spring-for-kids-to-explore-nature/) ## Products - [Press Out and Colour: Butterflies](https://nosycrow.com/product/press-out-and-colour-butterflies/) - [Pip and Posy: The Christmas Snowman \(A TV tie\-in picture book\)](https://nosycrow.com/product/pip-and-posy-the-christmas-snowman-a-tv-tie-in-picture-book/) - [Press Out and Decorate: Narwhals and Mermaids](https://nosycrow.com/product/press-out-and-decorate-narwhals-and-mermaids/) - [Fairy Trails: Jack and the Beanstalk \(\#2\)](https://nosycrow.com/product/fairy-trails-jack-and-the-beanstalk/) - [Pip and Posy: Snow Coaches](https://nosycrow.com/product/pip-and-posy-snow-coaches/) ## 3D FlipBook - [2024 catalogue](https://nosycrow.com/blog/3d-flip-book/2024-catalogue/) ## Categories - [Nosy Crow Blogs](https://nosycrow.com/blog/category/nosy-crow-blogs/) - [Nest Press](https://nosycrow.com/blog/category/nest-press/) - [Chapter Extracts](https://nosycrow.com/blog/category/chapter-extracts/) - [Guest Posts](https://nosycrow.com/blog/category/guest-posts/) - [Recommended Reads](https://nosycrow.com/blog/category/recommended-reads/) ## Tags - [Nosy Crow](https://nosycrow.com/blog/tag/nosy-crow/) - [children's books](https://nosycrow.com/blog/tag/childrens-books/) - [preview](https://nosycrow.com/blog/tag/preview/) - [look inside](https://nosycrow.com/blog/tag/look-inside/) - [behind the scenes](https://nosycrow.com/blog/tag/behind-the-scenes/) ## Book formats - [Paperback](https://nosycrow.com/blog/book-format/paperback/) - [Board Book](https://nosycrow.com/blog/book-format/board-book/) - [Hardback](https://nosycrow.com/blog/book-format/hardback/) ## Interests - [animals](https://nosycrow.com/blog/interest/animals/) - [adventure](https://nosycrow.com/blog/interest/adventure/) - [funny](https://nosycrow.com/blog/interest/funny/) - [friendship](https://nosycrow.com/blog/interest/friendship/) - [nature](https://nosycrow.com/blog/interest/nature/) ## Product categories - [Books](https://nosycrow.com/product-category/books/) ## Product tags - [animals](https://nosycrow.com/product-tag/animals/) - [gift](https://nosycrow.com/product-tag/gift/) - [funny](https://nosycrow.com/product-tag/funny/) - [adventure](https://nosycrow.com/product-tag/adventure/) - [friendship](https://nosycrow.com/product-tag/friendship/) ## Resource Type - [Activity Sheet](https://nosycrow.com/blog/resourcetype/activity-sheet/) - [Teacher Resource](https://nosycrow.com/blog/resourcetype/teacher-resource/) ## Series - [Bizzy Bear](https://nosycrow.com/series/bizzy-bear/) - [Felt Flaps](https://nosycrow.com/series/felt-flaps/) - [Zoe's Rescue Zoo](https://nosycrow.com/series/zoes-rescue-zoo/) - [Pip and Posy](https://nosycrow.com/series/pip-and-posy/) - [Sing Along with Me\!](https://nosycrow.com/series/sing-along-with-me/) ## Partnerships - [National Trust](https://nosycrow.com/partnership/national-trust/) - [The British Museum](https://nosycrow.com/partnership/british-museum/) - [University of Cambridge](https://nosycrow.com/partnership/university-of-cambridge/) ## Category - [Numbers](https://nosycrow.com/wordcat/numbers/) - [Around the Home](https://nosycrow.com/wordcat/around-the-home/) - [Animals](https://nosycrow.com/wordcat/animals/) - [In the Garden](https://nosycrow.com/wordcat/in-the-garden/) - [Time to Eat](https://nosycrow.com/wordcat/time-to-eat/) ## Optional - [Sitemap index](https://nosycrow.com/sitemap_index.xml)
Document
Not stored for this site.