A list of AI agents and robots to block.
Find a file
2024-04-01 09:55:46 -07:00
LICENSE Initial commit 2024-03-27 10:48:29 -07:00
noai-logo.png Reduce logo size 2024-03-28 09:09:29 +00:00
README.md chore: additional resources 2024-04-01 09:55:46 -07:00
robots.txt Add GoogleOther 2024-03-28 10:00:58 -05:00

AI robots.txt

This is an open list of web crawlers associated with AI companies and the training of LLMs to block. We encourage you to contribute to and implement this list on your own site.

A number of these crawlers have been sourced from Dark Visitors and we appreciate the ongoing effort they put in to track these crawlers.


Additional resources

Spawning.ai
Create an ai.txt: an additional avenue to block crawlers. Example file:

# Spawning AI
# Prevent datasets from using the following file types

User-Agent: *
Disallow: /
Disallow: *

Have I Been Trained?
Search datasets for your content and request its removal.


Thank you to Glyn for pushing me to set this up after I posted about blocking these crawlers.