California Governor Gavin Newsom vetoed California’s proposed Safe and Secure Innovation for Frontier Artificial Intelligence Models Act in late September, fearing it would stifle “innovation.” [SB…
The other reason they don’t do it is because many models are trained on a large corpus of pirated texts, and documenting this would be a confession.
Not just in an ‘I scraped the new york times without permission’ kind of way, but in a ‘I illegally downloaded a torrent containing bestsellers from the last 30 years’ kind of way.
Bestsellers? There used to be torrents of basically all releases. My provider blocks torrent sites and I dont use a vpn so im not sure if people still do this, but downloading basically all books (in english) at once released in a certain period was possible
The other reason they don’t do it is because many models are trained on a large corpus of pirated texts, and documenting this would be a confession.
Not just in an ‘I scraped the new york times without permission’ kind of way, but in a ‘I illegally downloaded a torrent containing bestsellers from the last 30 years’ kind of way.
Bestsellers? There used to be torrents of basically all releases. My provider blocks torrent sites and I dont use a vpn so im not sure if people still do this, but downloading basically all books (in english) at once released in a certain period was possible
occasionally i see this for music (weekly new tracks)