BB84 to

LocalLLaMA@sh.itjust.worksEnglish · 2 days ago

New open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmark

2

31

New open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmark

BB84 to

LocalLLaMA@sh.itjust.worksEnglish · 2 days ago

2

deepseek-ai/DeepSeek-V3 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Absolutely humongous model. Mixture of 256 experts with 8 activated each time.

Aider leaderboard: The only model above 🐋 v3 here is ~~Open~~AI o1. DeepSeek is known to make amazing models and Aider rotates their benchmark over time, so it is unlikely that this is a train-on-benchmark situation.

Some more benchmarks: on Reddit.

You must log in or # to comment.

Chat

xodoh74984@lemmy.world
link
fedilink
English
arrow-up
2·
edit-2
4 hours ago
For the user whose VRAM knob goes to 11
- BB84OP
  link
  fedilink
  English
  arrow-up
  2·
  5 hours ago
  Someone managed to run it on a cluster of Mac Minis lol https://blog.exolabs.net/day-2/
toothbrush@lemmy.blahaj.zone
link
fedilink
English
arrow-up
1·
2 days ago
deleted by creator

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

7 users / day
38 users / week
115 users / month
262 users / 6 months
9 local subscribers
2.33K subscribers
230 Posts
932 Comments
Modlog