
Deepseek's owners are being accused of stealing OpenAI's raw training data to build their model, essentially skipping an enormous part of the learning process.
Deepseek taking OpenAI's data isn't really the big concern in all this, though. It's the sheer amount of data being fed to it by the fuckwits that are actively using Deepseek that should be troubling folk:
( , Wed 29 Jan 2025, 12:58, Reply)

www.theregister.com/2025/01/30/deepseek_database_left_open/
( , Thu 30 Jan 2025, 11:20, Reply)

But I'm not sure that gives you moral permission to use it however you want and create completely derivative works without any credit?
Maybe it does and i haven't thought about it enough but i don't think any of the artists were very pleased.
( , Wed 29 Jan 2025, 14:03, Reply)

But people have been ripping off other people's stuff since the dawn of time, never mind the dawn of the internet, or the dawn of 'AI'. It's just a lot easier now.
( , Wed 29 Jan 2025, 14:06, Reply)

I think it's the scale that's taken people by surprise. Instead of copying a few paragraphs out of a book or a bit of music, it's the entire internet that's being hoovered up.
( , Wed 29 Jan 2025, 15:32, Reply)

( , Thu 30 Jan 2025, 10:37, Reply)