mirror of
https://github.com/ai-robots-txt/ai.robots.txt.git
synced 2025-04-05 19:37:45 +00:00
Merge pull request #6 from mattlinares/main
Adds table of bot details and metrics
This commit is contained in:
commit
964b29d330
2 changed files with 33 additions and 1 deletions
|
@ -4,7 +4,9 @@
|
||||||
|
|
||||||
This is an open list of web crawlers associated with AI companies and the training of LLMs to block. We encourage you to contribute to and implement this list on your own site.
|
This is an open list of web crawlers associated with AI companies and the training of LLMs to block. We encourage you to contribute to and implement this list on your own site.
|
||||||
|
|
||||||
A number of these crawlers have been sourced from [Dark Visitors](https://darkvisitors.com) and we appreciate the ongoing effort they put in to track these crawlers.
|
A number of these crawlers have been sourced from [Dark Visitors](https://darkvisitors.com) and we appreciate the ongoing effort they put in to track these crawlers.
|
||||||
|
|
||||||
|
If you'd like to add information about a crawler to the list, please make a pull request with the bot name added to `robots.txt`, `ai.txt`, and any relevant details in `table-of-bot-metrics.md` to help people understand what's crawling.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
|
|
30
table-of-bot-metrics.md
Normal file
30
table-of-bot-metrics.md
Normal file
|
@ -0,0 +1,30 @@
|
||||||
|
|Name |Operator |Respects `robots.txt` |Data use |Visit regularity |Description |
|
||||||
|
|----------------|---------|-----------------------|----------|------------------|-------------|
|
||||||
|
|AdsBot-Google | | | | | |
|
||||||
|
|Amazonbot | | | | | |
|
||||||
|
|anthropic-ai | | | | | |
|
||||||
|
|Applebot | | | | | |
|
||||||
|
|AwarioRssBot | | | | | |
|
||||||
|
|AwarioSmartBot | | | | | |
|
||||||
|
|Bytespider | | | | | |
|
||||||
|
|CCBot | | | | | |
|
||||||
|
|ChatGPT-User | | | | | |
|
||||||
|
|ClaudeBot | | | | | |
|
||||||
|
|Claude-Web | | | | | |
|
||||||
|
|coher-ai | | | | | |
|
||||||
|
|DataForSeoBot | | | | | |
|
||||||
|
|FacebookBot | | | | | |
|
||||||
|
|Google-Extended| | | | | |
|
||||||
|
|GoogleOther | | | | | |
|
||||||
|
|GPTBot | | | | | |
|
||||||
|
|ImagesiftBot | | | | | |
|
||||||
|
|magpie-crawler | | | | | |
|
||||||
|
|Meltwater | | | | | |
|
||||||
|
|omgili | | | | | |
|
||||||
|
|omgilibot | | | | | |
|
||||||
|
|peer39_crawler| | | | | |
|
||||||
|
|peer39_crawler/1.0| | | | | |
|
||||||
|
|PerplexityBot | | | | | |
|
||||||
|
|PiplBot | | | | | |
|
||||||
|
|Seekr | | | | | |
|
||||||
|
|YouBot | | | | | |
|
Loading…
Reference in a new issue