OpenAI destroyed a trove of books used to train AI models. The employees who collected the data are gone.
OpenAI destroyed a trove of books used to train AI models. The employees who collected the data are gone.
New legal filings reveal OpenAI deleted 2 massive datasets that it used to train a powerful AI model. The employees who built the datasets are gone.
"A corporation destroyed a trove of books!"
"That's terrible, what books were lost?"
"Oh, none. It was a digital collection."
"But people will be inconvenienced by losing the repository, right?"
"No, I mean a corporation deleted a dataset they made, trained from a metaphorical trove of books."
"I'm leaving."
23 2 ReplyNice to see Lemmy users don't read the articles before posting stupid comments. The internet never changes.
The article is about a lawsuit by book authors.
OpenAI trained their models on copyrighted material.
OpenAI deleted the sources so the list of copyrighted materials is unknown.
15 1 Replyit feels like a significant portion of Lemmy users have not met a boot they did not like the taste of
5 2 Reply
Oh no.
Anyway.
The free stuff we put out on the internet got scraped by a robot?
Now we're gonna sue...Google? No it's fine when they do it.
3 10 Reply