Merge pull request #6 from mattlinares/main

Adds table of bot details and metrics
This commit is contained in:
Cory Dransfeldt 2024-04-08 12:48:35 -07:00 committed by GitHub
commit 964b29d330
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 33 additions and 1 deletions

View file

@ -6,6 +6,8 @@ This is an open list of web crawlers associated with AI companies and the traini
A number of these crawlers have been sourced from [Dark Visitors](https://darkvisitors.com) and we appreciate the ongoing effort they put in to track these crawlers.
If you'd like to add information about a crawler to the list, please make a pull request with the bot name added to `robots.txt`, `ai.txt`, and any relevant details in `table-of-bot-metrics.md` to help people understand what's crawling.
---
## Additional resources

30
table-of-bot-metrics.md Normal file
View file

@ -0,0 +1,30 @@
|Name |Operator |Respects `robots.txt` |Data use |Visit regularity |Description |
|----------------|---------|-----------------------|----------|------------------|-------------|
|AdsBot-Google | | | | | |
|Amazonbot | | | | | |
|anthropic-ai | | | | | |
|Applebot | | | | | |
|AwarioRssBot | | | | | |
|AwarioSmartBot | | | | | |
|Bytespider | | | | | |
|CCBot | | | | | |
|ChatGPT-User | | | | | |
|ClaudeBot | | | | | |
|Claude-Web | | | | | |
|coher-ai | | | | | |
|DataForSeoBot | | | | | |
|FacebookBot | | | | | |
|Google-Extended| | | | | |
|GoogleOther | | | | | |
|GPTBot | | | | | |
|ImagesiftBot | | | | | |
|magpie-crawler | | | | | |
|Meltwater | | | | | |
|omgili | | | | | |
|omgilibot | | | | | |
|peer39_crawler| | | | | |
|peer39_crawler/1.0| | | | | |
|PerplexityBot | | | | | |
|PiplBot | | | | | |
|Seekr | | | | | |
|YouBot | | | | | |