From b229f5b9366a0b9a77a4573589ed861de16db435 Mon Sep 17 00:00:00 2001
From: Glyn Normington <glyn@underlap.org>
Date: Thu, 17 Oct 2024 12:25:54 +0100
Subject: [PATCH 1/2] Re-order the FAQ

The "why" question should come first.
---
 FAQ.md | 20 ++++++++++----------
 1 file changed, 10 insertions(+), 10 deletions(-)

diff --git a/FAQ.md b/FAQ.md
index 49cbdfb..1b3f247 100644
--- a/FAQ.md
+++ b/FAQ.md
@@ -1,5 +1,15 @@
 # Frequently asked questions
 
+## Why should we block these crawlers?
+
+They're extractive, confer no benefit to the creators of data they're ingesting and also have wide-ranging negative externalities.
+
+**[How Tech Giants Cut Corners to Harvest Data for A.I.](https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m)**
+> OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems.
+
+**[How AI copyright lawsuits could make the whole industry go extinct](https://www.theverge.com/24062159/ai-copyright-fair-use-lawsuits-new-york-times-openai-chatgpt-decoder-podcast)**
+> The New York Times' lawsuit against OpenAI is part of a broader, industry-shaking copyright challenge that could define the future of AI.
+
 ## How do we know AI companies/bots respect `robots.txt`?
 
 The short answer is that we don't. `robots.txt` is a well-established standard, but compliance is voluntary. There is no enforcement mechanism.
@@ -36,16 +46,6 @@ That depends on your stack.
 - Vercel
   - [Block AI Bots Firewall Rule](https://vercel.com/templates/firewall/block-ai-bots-firewall-rule) by Vercel
 
-## Why should we block these crawlers?
-
-They're extractive, confer no benefit to the creators of data they're ingesting and also have wide-ranging negative externalities.
-
-**[How Tech Giants Cut Corners to Harvest Data for A.I.](https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m)**
-> OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems.
-
-**[How AI copyright lawsuits could make the whole industry go extinct](https://www.theverge.com/24062159/ai-copyright-fair-use-lawsuits-new-york-times-openai-chatgpt-decoder-podcast)**
-> The New York Times' lawsuit against OpenAI is part of a broader, industry-shaking copyright challenge that could define the future of AI.
-
 ## How can I contribute?
 
 Open a pull request. It will be reviewed and acted upon appropriately. **We really appreciate contributions** — this is a community effort.

From e6bb7cae9ead3e33078c3b9632a44b3234f241ba Mon Sep 17 00:00:00 2001
From: Glyn Normington <glyn@underlap.org>
Date: Thu, 17 Oct 2024 12:27:05 +0100
Subject: [PATCH 2/2] Augment the "why" FAQ

Ref: https://github.com/ai-robots-txt/ai.robots.txt/issues/40#issuecomment-2419078796
---
 FAQ.md | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/FAQ.md b/FAQ.md
index 1b3f247..4d58350 100644
--- a/FAQ.md
+++ b/FAQ.md
@@ -10,6 +10,8 @@ They're extractive, confer no benefit to the creators of data they're ingesting
 **[How AI copyright lawsuits could make the whole industry go extinct](https://www.theverge.com/24062159/ai-copyright-fair-use-lawsuits-new-york-times-openai-chatgpt-decoder-podcast)**
 > The New York Times' lawsuit against OpenAI is part of a broader, industry-shaking copyright challenge that could define the future of AI.
 
+Crawlers also sometimes impact the performance of crawled sites, or even take them down.
+
 ## How do we know AI companies/bots respect `robots.txt`?
 
 The short answer is that we don't. `robots.txt` is a well-established standard, but compliance is voluntary. There is no enforcement mechanism.