mirror of
https://github.com/ai-robots-txt/ai.robots.txt.git
synced 2025-04-05 03:17:46 +00:00

Used by Google to crawl for internal research and development. It’s unknown what exactly this entails, but is a generic user agent that is used when no other appropriate user agent is available. Documentation available from Google: https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers
26 lines
608 B
Text
26 lines
608 B
Text
User-agent: AdsBot-Google
|
|
User-agent: Amazonbot
|
|
User-agent: anthropic-ai
|
|
User-agent: Applebot
|
|
User-agent: AwarioRssBot
|
|
User-agent: AwarioSmartBot
|
|
User-agent: Bytespider
|
|
User-agent: CCBot
|
|
User-agent: ChatGPT-User
|
|
User-agent: ClaudeBot
|
|
User-agent: Claude-Web
|
|
User-agent: cohere-ai
|
|
User-agent: DataForSeoBot
|
|
User-agent: FacebookBot
|
|
User-agent: Google-Extended
|
|
User-agent: GoogleOther
|
|
User-agent: GPTBot
|
|
User-agent: ImagesiftBot
|
|
User-agent: magpie-crawler
|
|
User-agent: omgili
|
|
User-agent: omgilibot
|
|
User-agent: peer39_crawler
|
|
User-agent: peer39_crawler/1.0
|
|
User-agent: PerplexityBot
|
|
User-agent: YouBot
|
|
Disallow: /
|