Spaces:

huggingchat
/

chat-ui

Running

App Files Files Community

754

Please add newer models (deepseek v3)

#645

by Someone2077 - opened Dec 28, 2024

Discussion

Someone2077

Dec 28, 2024

•

edited Jan 10

Going by the benchmarks it's the best open-source model as of rn. It's a huge model but the active parameters are just 37B.

Benchmarks:

Live bench:

It's a "thinking" model so you'll also need a little addition to the UI to hide the thinking part.
Should I expect it to be available in hugging chat any time soon or is it too costly?
Either ways, I appreciate what you guys are doing and wish you the best.

Someone2077 changed discussion title from Deepseek v3 to Please add newer models Dec 29, 2024

Someone2077 changed discussion title from Please add newer models to Please add newer models (deepseek v3) Dec 29, 2024

Smorty100

Dec 29, 2024

Deepseek V3 is obviously a very big and spicy release. It will Certainly be very useful. BUT! It's - like - SOOO large, I don't believe that the HF team will host this 600B+ model.
They hosted Llama3.1 405B for a while, after which they took it down, since it simply consumed too many resources while also being a free service.
Deepseek V3 is a MoE model though, so it generates a LOT quicker than Llama 405B.
If we are lucky, maybe we get a Q4 quant or maybe even Q8 quant of the model on HuggingChat, but that would still eat over 300GB of VRAM, which is crazy expensive.

Infranta

Jan 8

•

edited Jan 8

I don't think this is a meaningful choice. The biggest advantage of Huggingchat is the loose censorship. As we all know, the current main models Qwen2.5 and Llama are suffocated censored models. I don't want to see DeepSeek on this list.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment