HTTPS surface reachable (robots ✓, sitemap ✗, title ✓)
Why it matters: Public files — robots.txt, sitemap.xml, head meta — are what attackers see first during reconnaissance. Misadvertised paths, stale sitemaps, and verbose generators leak more than intended (ISO 27001 A.8.9).
robots.txt
present
User-agent: AASA-Bot
User-agent: ADmantX
User-agent: AdsBot-Google
User-agent: AdsBot-Google-Mobile
User-agent: AmazonAdBot
User-agent: Amzn-User
User-agent: Applebot
User-agent: AppleNewsBot
User-agent: Bingbot
User-agent: BingPreview
User-agent: BrightEdge
User-agent: BrightEdgeOnCrawl
User-agent: CensysInspect
User-agent: ChatGPT-User
User-agent: Cision
User-agent: Clickagy
User-agent: Concert
User-agent: ContextualBot
User-agent: CriteoBot
User-agent: DatadogSynthetics
User-agent: datadome-pageprotect-scanner
User-agent: Discordbot
User-agent: doubleverify
User-agent: DuckDuckBot
User-agent: ElevenlabsBot
User-agent: Embedly
User-agent: facebookexternalhit
User-agent: Googlebot
User-agent: Googlebot Smartphone
User-agent: Googlebot-News
User-agent: Google-Display-Ads-Bot
User-agent: Google-InspectionTool
User-agent: GoogleOther
User-agent: Google-Read-Aloud
User-agent: Google-Safety
User-agent: Google-Site-Verification
User-agent: GTmetrix
User-agent: GumGumBot
User-agent: ias_crawler
User-agent: Iframely
User-agent: leiki
User-agent: LinkedInBot
User-agent: LinkTiger
User-agent: Mantisbot
User-agent: Mediapartners-Google
User-agent: meta-externalads
User-agent: meta-webindexer
User-agent: MicrosoftPreview
User-agent: MJ12bot
User-agent: Moreover
User-agent: msnbot
User-agent: NFBNewslineRobot
User-agent: OAI-SearchBot
User-agent: Oncrawl
User-agent: Opebot-v
User-agent: Opoint
User-agent: Optimizer
User-agent: outbrain
User-agent: Pinterestbot
User-agent: Prerender
User-agent: Proofpoint
User-agent: proximic
User-agent: PubMatic Crawler Bot
User-agent: Quantcastbot
User-agent: Qwantbot
User-agent: Reuters SEO Screaming Frog Spider 007
User-agent: Reuters-NAUWI
User-agent: Scom-Crawler-For-Reuters
User-agent: SinceraSyntheticUser
User-agent: Slurp
User-agent: SmartologyBot
User-agent: snews
User-agent: SocialFlow
User-agent: StatusCake
User-agent: Storebot-Google
User-agent: Stripebot
User-agent: TTD-Content
User-agent: Twitterbot
User-agent: URLDefense
User-agent: Verity
User-agent: vuln_scan_by_trustedsite_com_halo_security
User-agent: WISEbot
User-agent: Xenu Link Sleuth
User-agent: Yahoo Link Preview
User-agent: Yahoo! JAPAN
User-agent: YahooMailProxy
Disallow: /finance/stocks/option
Disallow: /finance/stocks/financialHighlights
Disallow: /search
Disallow: /site-search/
Disallow: /beta
Disallow: /designtech
Disallow: /featured-optimize
Disallow: /energy-test
Disallow: /article/beta
Disallow: /sponsored/previewcampaign
Disallow: /sponsored/previewarticle
Disallow: /test/
Disallow: /news/archive/commentary
Disallow: /brandfeatures/venture-capital
Disallow: /assets/siteindex
Disallow: /article/api/
Disallow: /practical-law-the-journal/search/
Disallow: /pf/api/
Disallow: /fr/
Disallow: /it/
Disallow: /es/
Disallow: /pt/
Disallow: /de/
Disallow: /latam/
Disallow: /account/subscribe/payment/
# Block all other bots
User-agent: *
Disallow: /
SITEMAP: https://www.reuters.com/arc/outboundfeeds/sitemap-index/?outputType=xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml
SITEMAP: https://www.reuters.com/plus/sitemap-index.xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/sitemap-plj-index/?outputType=xml
SITEMAP: https://www.reuters.com/graphics/sitemap.xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/sitemap-index/pictures/?outputType=xml
SITEMAP: https://www.reuters.com/static/video-sitemap/us/sitemap_video_index.xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/topic-sitemap/?outputType=xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/author-sitemap/?outputType=xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/pressrelease-sitemap/?outputType=xml
head
- title
- reuters.com
- description
- —
social
no OpenGraph or Twitter meta tags found