mirror of
https://github.com/ai-robots-txt/ai.robots.txt.git
synced 2025-04-12 05:57:45 +00:00
Update table-of-bot-metrics.md
This commit is contained in:
parent
c1e6265ef4
commit
d2cd37442c
1 changed files with 1 additions and 0 deletions
|
@ -1,5 +1,6 @@
|
|||
| Name | Operator | Respects `robots.txt` | Data use | Visit regularity | Description |
|
||||
|-----|----------|-----------------------|----------|------------------|-------------|
|
||||
| AI | [Various](https://postopen.org/content-protection-project/) | Yes | Content is used to train artificial intelligence (AI), large language models (LLM), machine learning systems or neural networks. | No information. provided. | PostOpen recommendation. |
|
||||
| AI2Bot | [Ai2](https://allenai.org/crawler) | Yes | Content is used to train open language models. | No information. provided. | Explores 'certain domains' to find web content. |
|
||||
| Ai2Bot-Dolma | [Ai2](https://allenai.org/crawler) | Yes | Content is used to train open language models. | No information. provided. | Explores 'certain domains' to find web content. |
|
||||
| Amazonbot | Amazon | Yes | Service improvement and enabling answers for Alexa users. | No information. provided. | Includes references to crawled website when surfacing answers via Alexa; does not clearly outline other uses. |
|
||||
|
|
Loading…
Reference in a new issue