HTTPS surface reachable (robots ✓, sitemap ✗, title ✓)
Why it matters: Public files — robots.txt, sitemap.xml, head meta — are what attackers see first during reconnaissance. Misadvertised paths, stale sitemaps, and verbose generators leak more than intended (ISO 27001 A.8.9).
robots.txt
present
# robots.txt updated 2026-04-01
User-agent: *
Disallow: /json/
Disallow: /cmlink/
Disallow: /premiumContent
Disallow: /*.pdf
Disallow: /js/adobe/at.js
Disallow: /redaktion?cache
Disallow: *mostclicked.gfn
Disallow: /;suche=artikelsuche/suche/
Disallow: /suche/
Disallow: *?sort=
Disallow: /dist/scripts/main_faz-comment.js
Disallow: /html-ajax*
Disallow: /livewire*
Disallow: *atarget.js
Disallow: */sport-ergebnisse/*/ma*/$
Disallow: */sport-ergebnisse/*/co*/$
Disallow: /api/collect/
Disallow: /podcast-service/
Disallow: */event/widgets/
Disallow: */event/*/live/
Disallow: */event/*/html/
Disallow: /source/
Disallow: /iq/
Disallow: /allesbeste/*?s=*
Disallow: /allesbeste/*&s=*
Disallow: */asset-comment-info/*
Disallow: /membership/*
Disallow: /kaufkompass/wp-admin/
Disallow: /kaufkompass/newsticker/
Disallow: /kaufkompass/wp-content/ab_api_cache/*
Disallow: /kaufkompass/search/
Disallow: /kaufkompass/*?p=*
Disallow: /kaufkompass/*&p=*
Disallow: /kaufkompass/*&preview=*
Allow: /kaufkompass/wp-admin/admin-ajax.php
User-agent: Meltwater
Disallow: /
User-agent: NewsNow
Disallow: /
User-agent: Bloodhound
Disallow: /
User-agent: cydralspider
Disallow: /
User-agent: downloadexpress
Disallow: /
User-agent: gammaSpider
Disallow: /
User-agent: ObjectsSearch
Disallow: /
User-agent: Pimptrain
Disallow: /
User-agent: Raven
Disallow: /
User-agent: wapspider
Disallow: /
User-agent: WebZinger
Disallow: /
User-agent: Fasterfox
Disallow: /
User-agent: sentibot
Disallow: /
User-agent: GPTBot
Disallow: /
Allow: /kaufkompass/
User-Agent: omgili
Disallow: /
User-Agent: omgilibot
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: Google-Extended
Disallow: /
Allow: /source/
Allow: /*-accg-
User-agent: Bytespider
Disallow: /
User-agent: anthropic-ai
Disallow: /
User-agent: Diffbot
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: YouBot
Disallow: /
User-agent: Applebot-Extended
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Claude-Web
Disallow: /
User-agent: Meta-ExternalAgent
Disallow: /
User-agent: DeepSeekBot
Disallow: /
User-agent: DeepSeek
Disallow: /
user-agent: GoogleOther
allow: /source/
User-Agent: Claude-SearchBot
Disallow: /
User-Agent: Claude-User
Disallow: /
User-Agent: cohere-ai
Disallow: /
User-Agent: cohere-training-data-crawler
Disallow: /
User-Agent: Amazonbot
Disallow: /
User-Agent: Timpibot
Disallow: /
User-Agent: AI2Bot
Disallow: /
User-Agent: DuckAssistBot
Disallow: /
User-Agent: Kangaroo Bot
Disallow: /
User-Agent: PanguBot
Disallow: /
User-Agent: MistralAI-User
Disallow: /
User-Agent: Devin
Disallow: /
User-Agent: PetalBot
Disallow: /
User-Agent: bigsur.ai
Disallow: /
User-Agent: CloudVertexBot
Disallow: /
User-Agent: LinerBot
Disallow: /
User-Agent: OAI-SearchBot
Disallow: /
Allow: /kaufkompass/
User-Agent: ChatGPT-User
Disallow: /
Allow: /kaufkompass/
# Legal notice: faz.net expressly reserves the right to use its content for commercial text and data mining (§44 b UrhG).
# The use of robots or other automated means to access faz.net or collect or mine data without the express permission of faz.net is strictly prohibited.
# faz.net may, in its discretion, permit certain automated access to certain faz.net pages.
# If you would like to apply for permission to crawl faz.net, collect or use data, please email nutzungsrechte@faz.de.
head
- title
- Aktuelle Nachrichten online - FAZ.NET
- description
- —
social
no OpenGraph or Twitter meta tags found