Hellfire103@lemmy.ca to Not The Onion@lemmy.worldEnglish · 2 天前OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coexternal-linkmessage-square106fedilinkarrow-up11.11Karrow-down111
arrow-up11.1Karrow-down1external-linkOpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coHellfire103@lemmy.ca to Not The Onion@lemmy.worldEnglish · 2 天前message-square106fedilink
minus-squareAvid Amoeba@lemmy.calinkfedilinkEnglisharrow-up23·2 天前Is there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up36·edit-22 天前It’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233 But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community. Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
minus-squaremorrowind@lemmy.mllinkfedilinkEnglisharrow-up6·2 天前Not distillate, they just trained on the outputs of openai
Is there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
It’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233
But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community.
Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
Thank you 🙏
Not distillate, they just trained on the outputs of openai