Skip to content
Snippets Groups Projects
Commit 68d48c8e authored by Jan-Philipp Igla's avatar Jan-Philipp Igla :nerd:
Browse files

add README and add robots.txt

parents
No related branches found
No related tags found
No related merge requests found
README.md 0 → 100644
# Nutzungsvorbehalt der ARD
**Stand: 13.05.2025**
- [ ] Füge bitte folgenden Block (bitte nicht verändern!) Deiner **robots.txt** hinzu (Bsp. https://www.domain.de/robots.txt)
- [ ] Prüfe nach Veröffentlichung, ob Deine robots.txt die neusten Änderungen enthält
- [ ] Nutze das Tool https://technicalseo.com/tools/robots-txt/ zur Validierung der neuen robots.txt
**Nach erstmaliger Anpassung Deiner robots.txt**: Gibt bitte eine Rückmeldung samt URL (https://www.domain.de/robots.txt) an philipp.igla@hr.de
Bei Rückfragen oder Unklarheiten bitte jederzeit bei philipp.igla@hr.de (Modul SEO ARD) melden.
---
```txt
############
# Stand: 13.05.2025
# Liste an Crawler (Modul SEO ARD)
# Amazon
User-agent: Amazonbot
Disallow: /
# Anthropic
User-agent: anthropic-ai
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Claude-User
Disallow: /
User-agent: Claude-SearchBot
Disallow: /
User-agent: claude-web
Disallow: /
# Apple
User-agent: Applebot-Extended
Disallow: /
# ByteDance
User-agent: Bytespider
Disallow: /
User-agent: TikTokSpider
Disallow: /
# Cohere
User-agent: cohere-ai
Disallow: /
User-agent: cohere-training-data-crawler
Disallow: /
# Common Crawl
User-agent: CCBot
Disallow: /
# Diffbot
User-agent: DiffBot
Disallow: /
# DuckDuckGo
User-agent: DuckAssistBot
Disallow: /
# Google
User-agent: Google-Extended
Disallow: /
# Huawei
User-agent: PetalBot
Disallow: /
User-agent: PanguBot
Disallow: /
# Meta
User-agent: meta-externalagent
Disallow: /
User-agent: Meta-ExternalFetcher
Disallow: /
User-agent: FacebookBot
Disallow: /
# Mistral
User-agent: MistralAI-User
Disallow: /
# OpenAI
User-agent: ChatGPT-User
Disallow: /
User-agent: GPTBot
Disallow: /
User-agent: OAI-SearchBot
Disallow: /
User-agent: ChatGPT-User/2.0
Disallow: /
# Perplexity
User-agent: PerplexityBot
Disallow: /
User-agent: Perplexity-User
Disallow: /
# Webz.io
User-agent: omgili
Disallow: /
User-agent: omgilibot
Disallow: /
User-agent: Webzio-Extended
Disallow: /
# You.com
User-agent: YouBot
Disallow: /
# Zyte
User-agent: Scrapy
Disallow: /
############
```
############
# Stand: 13.05.2025
# Liste an Crawler (Modul SEO ARD)
# Amazon
User-agent: Amazonbot
Disallow: /
# Anthropic
User-agent: anthropic-ai
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Claude-User
Disallow: /
User-agent: Claude-SearchBot
Disallow: /
User-agent: claude-web
Disallow: /
# Apple
User-agent: Applebot-Extended
Disallow: /
# ByteDance
User-agent: Bytespider
Disallow: /
User-agent: TikTokSpider
Disallow: /
# Cohere
User-agent: cohere-ai
Disallow: /
User-agent: cohere-training-data-crawler
Disallow: /
# Common Crawl
User-agent: CCBot
Disallow: /
# Diffbot
User-agent: DiffBot
Disallow: /
# DuckDuckGo
User-agent: DuckAssistBot
Disallow: /
# Google
User-agent: Google-Extended
Disallow: /
# Huawei
User-agent: PetalBot
Disallow: /
User-agent: PanguBot
Disallow: /
# Meta
User-agent: meta-externalagent
Disallow: /
User-agent: Meta-ExternalFetcher
Disallow: /
User-agent: FacebookBot
Disallow: /
# Mistral
User-agent: MistralAI-User
Disallow: /
# OpenAI
User-agent: ChatGPT-User
Disallow: /
User-agent: GPTBot
Disallow: /
User-agent: OAI-SearchBot
Disallow: /
User-agent: ChatGPT-User/2.0
Disallow: /
# Perplexity
User-agent: PerplexityBot
Disallow: /
User-agent: Perplexity-User
Disallow: /
# Webz.io
User-agent: omgili
Disallow: /
User-agent: omgilibot
Disallow: /
User-agent: Webzio-Extended
Disallow: /
# You.com
User-agent: YouBot
Disallow: /
# Zyte
User-agent: Scrapy
Disallow: /
############
\ No newline at end of file
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment