mirror of
https://github.com/ai-robots-txt/ai.robots.txt.git
synced 2025-04-16 15:27:46 +00:00
Re-order the FAQ
The "why" question should come first.
This commit is contained in:
parent
b1491d2694
commit
b229f5b936
1 changed files with 10 additions and 10 deletions
20
FAQ.md
20
FAQ.md
|
@ -1,5 +1,15 @@
|
||||||
# Frequently asked questions
|
# Frequently asked questions
|
||||||
|
|
||||||
|
## Why should we block these crawlers?
|
||||||
|
|
||||||
|
They're extractive, confer no benefit to the creators of data they're ingesting and also have wide-ranging negative externalities.
|
||||||
|
|
||||||
|
**[How Tech Giants Cut Corners to Harvest Data for A.I.](https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m)**
|
||||||
|
> OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems.
|
||||||
|
|
||||||
|
**[How AI copyright lawsuits could make the whole industry go extinct](https://www.theverge.com/24062159/ai-copyright-fair-use-lawsuits-new-york-times-openai-chatgpt-decoder-podcast)**
|
||||||
|
> The New York Times' lawsuit against OpenAI is part of a broader, industry-shaking copyright challenge that could define the future of AI.
|
||||||
|
|
||||||
## How do we know AI companies/bots respect `robots.txt`?
|
## How do we know AI companies/bots respect `robots.txt`?
|
||||||
|
|
||||||
The short answer is that we don't. `robots.txt` is a well-established standard, but compliance is voluntary. There is no enforcement mechanism.
|
The short answer is that we don't. `robots.txt` is a well-established standard, but compliance is voluntary. There is no enforcement mechanism.
|
||||||
|
@ -36,16 +46,6 @@ That depends on your stack.
|
||||||
- Vercel
|
- Vercel
|
||||||
- [Block AI Bots Firewall Rule](https://vercel.com/templates/firewall/block-ai-bots-firewall-rule) by Vercel
|
- [Block AI Bots Firewall Rule](https://vercel.com/templates/firewall/block-ai-bots-firewall-rule) by Vercel
|
||||||
|
|
||||||
## Why should we block these crawlers?
|
|
||||||
|
|
||||||
They're extractive, confer no benefit to the creators of data they're ingesting and also have wide-ranging negative externalities.
|
|
||||||
|
|
||||||
**[How Tech Giants Cut Corners to Harvest Data for A.I.](https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m)**
|
|
||||||
> OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems.
|
|
||||||
|
|
||||||
**[How AI copyright lawsuits could make the whole industry go extinct](https://www.theverge.com/24062159/ai-copyright-fair-use-lawsuits-new-york-times-openai-chatgpt-decoder-podcast)**
|
|
||||||
> The New York Times' lawsuit against OpenAI is part of a broader, industry-shaking copyright challenge that could define the future of AI.
|
|
||||||
|
|
||||||
## How can I contribute?
|
## How can I contribute?
|
||||||
|
|
||||||
Open a pull request. It will be reviewed and acted upon appropriately. **We really appreciate contributions** — this is a community effort.
|
Open a pull request. It will be reviewed and acted upon appropriately. **We really appreciate contributions** — this is a community effort.
|
||||||
|
|
Loading…
Reference in a new issue