Deepseek Database Exposed

TheMachineStops@discuss.tchncs.de · edit-2 24 hours ago

Deepseek Database Exposed

monotremata@lemmy.ca · 6 hours ago

I think it’s reasonably likely. There was a research paper about how to do basically that a couple years ago. If you need a basic LLM trained on a specialized form of input and output, getting the expensive existing LLMs to generate that text for you is pretty efficient/inexpensive, so it’s a reasonable way to get a baseline model. Then you can add stuff like chain of reasoning and mixture of experts to improve the performance back up to where you need it. It’s not going to be a way to push the state of the art forward, but it’s sure a cheap way to catch up to models that have done that pushing.

Deepseek Database Exposed

Deepseek Database Exposed

Just a moment...