Please add newer models (deepseek v3)

#645
by Someone2077 - opened

Going by the benchmarks it's the best open-source model as of rn. It's a huge model but the active parameters are just 37B.

Benchmarks:
images (1).jpeg

Live bench:

benchmark-results-deepseek-v3-on-livebench-v0-n22fszq1819e1.png

It's a "thinking" model so you'll also need a little addition to the UI to hide the thinking part.
Should I expect it to be available in hugging chat any time soon or is it too costly?
Either ways, I appreciate what you guys are doing and wish you the best.

Someone2077 changed discussion title from Deepseek v3 to Please add newer models
Someone2077 changed discussion title from Please add newer models to Please add newer models (deepseek v3)

Deepseek V3 is obviously a very big and spicy release. It will Certainly be very useful. BUT! It's - like - SOOO large, I don't believe that the HF team will host this 600B+ model.
They hosted Llama3.1 405B for a while, after which they took it down, since it simply consumed too many resources while also being a free service.
Deepseek V3 is a MoE model though, so it generates a LOT quicker than Llama 405B.
If we are lucky, maybe we get a Q4 quant or maybe even Q8 quant of the model on HuggingChat, but that would still eat over 300GB of VRAM, which is crazy expensive.

I don't think this is a meaningful choice. The biggest advantage of Huggingchat is the loose censorship. As we all know, the current main models Qwen2.5 and Llama are suffocated censored models. I don't want to see DeepSeek on this list.

Sign up or log in to comment