From d2cd37442c0943b64553df578f445fc39541e99c Mon Sep 17 00:00:00 2001 From: Michael Davey Date: Mon, 23 Sep 2024 22:26:05 +0100 Subject: [PATCH] Update table-of-bot-metrics.md --- table-of-bot-metrics.md | 1 + 1 file changed, 1 insertion(+) diff --git a/table-of-bot-metrics.md b/table-of-bot-metrics.md index d9441b5..b3c5ee2 100644 --- a/table-of-bot-metrics.md +++ b/table-of-bot-metrics.md @@ -1,5 +1,6 @@ | Name | Operator | Respects `robots.txt` | Data use | Visit regularity | Description | |-----|----------|-----------------------|----------|------------------|-------------| +| AI | [Various](https://postopen.org/content-protection-project/) | Yes | Content is used to train artificial intelligence (AI), large language models (LLM), machine learning systems or neural networks. | No information. provided. | PostOpen recommendation. | | AI2Bot | [Ai2](https://allenai.org/crawler) | Yes | Content is used to train open language models. | No information. provided. | Explores 'certain domains' to find web content. | | Ai2Bot-Dolma | [Ai2](https://allenai.org/crawler) | Yes | Content is used to train open language models. | No information. provided. | Explores 'certain domains' to find web content. | | Amazonbot | Amazon | Yes | Service improvement and enabling answers for Alexa users. | No information. provided. | Includes references to crawled website when surfacing answers via Alexa; does not clearly outline other uses. |