Merge pull request #31 from glyn/addfaq

Add FAQ
2025-05-20 01:03:11 +00:00 · 2024-08-06 11:15:29 -07:00 · 2024-08-06 11:15:29 -07:00 · 85275e55b8
commit 85275e55b8
parent 3e91a84d11 6d2285f5e0
1 changed files with 7 additions and 1 deletions
--- a/FAQ.md
+++ b/FAQ.md
@ -2,7 +2,13 @@

 ## How do we know AI companies/bots respect `robots.txt`?

-The short answer is that we don't. `robots.txt` is a well-established standard but compliance is voluntary. There is no enforcement mechanism.
+The short answer is that we don't. `robots.txt` is a well-established standard, but compliance is voluntary. There is no enforcement mechanism.
+
+## Why might AI web crawlers respect `robots.txt`?
+
+Larger and/or reputable companies developing AI models probably wouldn't want to damage their reputation by ignoring `robots.txt`.
+
+Also, given the contentious nature of AI and the possibility of legislation limiting its development, companies developing AI models will probably want to be seen to be behaving ethically, and so should (eventually) respect `robots.txt`.

 ## Can we block crawlers based on user agent strings?