The public web and consent

July 1, 2024

Matt Birchler, writing on the state of the open web and LLMs crawling content without permission:

You could also choose to block my ad or to mess with my CSS. You could choose to read entirely in your RSS reader and never come to the site at all. You can save this to the read later service and read it on their site or in their app. You can download a local copy of anything on the site and do whatever you want with it. Search engines can index it and show my site to people looking stuff up on Google. And yes, LLMs can scrape my site to use it as food for their training.

I bet that everyone reading this was nodding along like, “yes, this is what’s so great about the open web!” up until that last one — then we probably had a split in opinions. […]