# robots.txt for https://www.tudublin.ie/ # Terminalfour Crawler User-agent: terminalfour-nutch-spider Disallow: /study/modules/ Allow: /intranet/ # Global Search Engine Rules User-agent: * Disallow: /intranet/ Disallow: /study/modules/ Disallow: /explore/news/archive-2019/ Disallow: /explore/news/archive-2020/ Disallow: /explore/news/archive-2021/ Disallow: /explore/news/archive-2022/ Disallow: /explore/news/archive-2023/ # Sitemap Sitemap: https://www.tudublin.ie/sitemap-en.xml