From ccb2e6bc593eafd3ff668544e57ec0762497c29f Mon Sep 17 00:00:00 2001 From: Cory Dransfeldt Date: Tue, 9 Apr 2024 12:51:11 -0700 Subject: [PATCH] chore: post --- ...data-collection-should-always-be-opt-in.md | 24 +++++++++++++++++++ 1 file changed, 24 insertions(+) create mode 100644 src/posts/2024/data-collection-should-always-be-opt-in.md diff --git a/src/posts/2024/data-collection-should-always-be-opt-in.md b/src/posts/2024/data-collection-should-always-be-opt-in.md new file mode 100644 index 00000000..da7b381f --- /dev/null +++ b/src/posts/2024/data-collection-should-always-be-opt-in.md @@ -0,0 +1,24 @@ +--- +date: '2024-04-09T13:00-08:00' +title: 'Data collection should always be opt in' +description: "If you're offering a service online, you should only collect data from users that is strictly required to operate service. I don't care what you're building." +tags: ['tech', 'AI', 'privacy'] +--- +If you're offering a service online, you should *only* collect data from users that is strictly required to operate service. I don't care what you're building. + +If you need my email to log me in, fine — don't send me anything I didn't ask for[^1]. And do not use it to try and sell me anything. I want utility, I don't want marketing[^2]. + +This has all been the pattern for tech for far too long — launch a service for "free", scale it, abuse your users, exit. It was done with social media (guess who got to deal with all the negative externalities?), we've pivoted from being ardently opposed to scraping data to repositioning the industry around scraping the entire internet. + +If you want access to "public" data it is incumbent upon you to obtain consent of the publisher — cite them, compensate them, don't attempt to trample them. Don't use it for your own benefit and ask for forgiveness later. If you're hosting someone's blog, they're likely paying you to do so — don't change your terms and send their data off to some parasitic AI partner. + +We need a reasonable baseline of respect for user privacy and user data. We need more data minimization and less collection. Holding data should be viewed as a tremendous responsibility and improper access and use of it should expose the holder to liability. + +If you're collecting data you shouldn't reasonably need, then you should be greeted with apprehension, resistance, ad blockers and mitigation efforts. + +Respect your users, provide value and quit grasping at data you don't need and aren't entitled to. + +{% render "partials/banners/github.liquid", url: 'https://github.com/ai-robots-txt/ai.robots.txt' %} + +[^1]: I'm looking at you LinkedIn — all those switches are hell, not agency. +[^2]: Don't argue that they can be the same thing. Stop. \ No newline at end of file