If an organization runs a survey in 2024 on whether it should get into AI, then they’ve already bodged an LLM into the system and they’re seeing if they can get away with it. Proton Mail is a priva…
we appear to be the first to write up the outrage coherently too. much thanks to the illustrious @self
Mistral’s Mixtral-8x7B-Instruct-v0.1 produced copyrighted content on 22% of the prompts.
did you know that a lesser-known side effect of the infinite monkeys approach is that they will produce whole sections of copyright content abso-dupo-lutely by accident? wild, I know! totes coinkeedink!
I’d be glad to provide it once you’ve come to your senses and want to discuss things like an adult
jesus fucking christ you must be a fucking terrible person to work with
I'm too old to discuss against bad faith arguments.
Especially with people who won't read the information I provide them showing their initial information was wrong.
One is a company that has something to sell, the other an article with citations showing why it's not easy to determine what percentage of a data set is infringing on copyright, or whether exact reproduction via "fishing expedition" prompting is a useful metric to determine if unauthorized copyright was used in training.
The dumbest take though is attacking Mistral of all LLMs, even though it's on an Apache 2.0 license.
god these weird little fuckers’ ability to fill a thread with garbage is fucking notable isn’t it? something about loving LLMs makes you act like an LLM. how depressing for them.
To think that when sneer club/techtakes migrated to lemmy, I was pretty sure we would not be getting a lot of incidental traffic to the communities. Just about as wrong as you can be.