83 134 438

Thomas Wolf PRO

thomwolf

https://thomwolf.io

AI & ML interests

NLP and open-source :-)

Recent Activity

upvoted a paper about 11 hours ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

liked a Space about 11 hours ago

Wan-AI/Wan2.1

liked a model about 11 hours ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

View all activity

Organizations

thomwolf's activity

upvoted a paper about 11 hours ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published 5 days ago • 57

liked a Space about 11 hours ago

1.27k

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

liked 3 models about 11 hours ago

liked a model 1 day ago

sesame/csm-1b

Text-to-Speech • Updated 3 days ago • 13k • 1.28k

liked a model 2 days ago

Qwen/Qwen2.5-32B

Text Generation • Updated Sep 20, 2024 • 121k • 128

liked 2 datasets 2 days ago

open-r1/codeforces-cots

Viewer • Updated 2 days ago • 238k • 3.86k • 77

gaia-benchmark/GAIA

Updated Feb 13 • 9.31k • 269

liked a Space 3 days ago

516

Sesame CSM

🌱

Conversational speech generation

liked a dataset 5 days ago

open-r1/OpenR1-Math-220k

Viewer • Updated 29 days ago • 450k • 53.7k • 515

liked a model 6 days ago

open-r1/OlympicCoder-7B

Text Generation • Updated 2 days ago • 3.79k • 119

liked a dataset 7 days ago

facebook/natural_reasoning

Viewer • Updated 27 days ago • 1.15M • 12.9k • 427

posted an update 7 days ago

Post

2321

We've kept pushing our Open-R1 project, an open initiative to replicate and extend the techniques behind DeepSeek-R1.

And even we were mind-blown by the results we got with this latest model we're releasing: ⚡️OlympicCoder ( open-r1/OlympicCoder-7B and open-r1/OlympicCoder-32B)

It's beating Claude 3.7 on (competitive) programming –a domain Anthropic has been historically really strong at– and it's getting close to o1-mini/R1 on olympiad level coding with just 7B parameters!

And the best part is that we're open-sourcing all about its training dataset, the new IOI benchmark, and more in our Open-R1 progress report #3: https://huggingface.co/blog/open-r1/update-3

Datasets are are releasing:
- open-r1/codeforces
- open-r1/codeforces-cots
- open-r1/ioi
- open-r1/ioi-test-cases
- open-r1/ioi-sample-solutions
- open-r1/ioi-cots
- open-r1/ioi-2024-model-solutions