Update table-of-bot-metrics.md

This commit is contained in:
Michael Davey 2024-09-23 22:26:05 +01:00 committed by GitHub
parent c1e6265ef4
commit d2cd37442c
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,5 +1,6 @@
| Name | Operator | Respects `robots.txt` | Data use | Visit regularity | Description |
|-----|----------|-----------------------|----------|------------------|-------------|
| AI | [Various](https://postopen.org/content-protection-project/) | Yes | Content is used to train artificial intelligence (AI), large language models (LLM), machine learning systems or neural networks. | No information. provided. | PostOpen recommendation. |
| AI2Bot | [Ai2](https://allenai.org/crawler) | Yes | Content is used to train open language models. | No information. provided. | Explores 'certain domains' to find web content. |
| Ai2Bot-Dolma | [Ai2](https://allenai.org/crawler) | Yes | Content is used to train open language models. | No information. provided. | Explores 'certain domains' to find web content. |
| Amazonbot | Amazon | Yes | Service improvement and enabling answers for Alexa users. | No information. provided. | Includes references to crawled website when surfacing answers via Alexa; does not clearly outline other uses. |