Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
50
Marco Zocca
ocramz
Follow
0 followers
ยท
4 following
https://unfoldml.com
ocramz_yo
ocramz
AI & ML interests
Program understanding, languages and compilers
Recent Activity
liked
a model
23 days ago
hexgrad/Kokoro-82M
liked
a model
29 days ago
deepseek-ai/DeepSeek-R1
reacted
to
onekq
's
post
with ๐
29 days ago
So ๐DeepSeek๐ hits the mainstream media. But it has been a star in our little cult for at least 6 months. Its meteoric success is not overnight, but two years in the making. To learn their history, just look at their ๐ค repo https://huggingface.co/deepseek-ai * End of 2023, they launched the first model (pretrained by themselves) following Llama 2 architecture * June 2024, v2 (MoE architecture) surpassed Gemini 1.5, but behind Mistral * September, v2.5 surpassed GPT 4o mini * December, v3 surpassed GPT 4o * Now R1 surpassed o1 Most importantly, if you think DeepSeek success is singular and unrivaled, that's WRONG. The following models are also near or equal the o1 bar. * Minimax-01 * Kimi k1.5 * Doubao 1.5 pro
View all activity
Organizations
Papers
2
arxiv:
2305.06161
arxiv:
2301.03988
models
None public yet
datasets
None public yet