Marco Zocca's picture

3 50

Marco Zocca

ocramz

·

https://unfoldml.com

AI & ML interests

Program understanding, languages and compilers

Recent Activity

liked a model 23 days ago

hexgrad/Kokoro-82M

liked a model 29 days ago

deepseek-ai/DeepSeek-R1

reacted to onekq's post with 👍 29 days ago

So 🐋DeepSeek🐋 hits the mainstream media. But it has been a star in our little cult for at least 6 months. Its meteoric success is not overnight, but two years in the making. To learn their history, just look at their 🤗 repo https://huggingface.co/deepseek-ai * End of 2023, they launched the first model (pretrained by themselves) following Llama 2 architecture * June 2024, v2 (MoE architecture) surpassed Gemini 1.5, but behind Mistral * September, v2.5 surpassed GPT 4o mini * December, v3 surpassed GPT 4o * Now R1 surpassed o1 Most importantly, if you think DeepSeek success is singular and unrivaled, that's WRONG. The following models are also near or equal the o1 bar. * Minimax-01 * Kimi k1.5 * Doubao 1.5 pro

View all activity

Organizations

Papers 2

arxiv:2305.06161

arxiv:2301.03988

models

None public yet

datasets

None public yet