Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
Tirtha Debnath
tdnathmlenthusiast
Follow
0 followers
Β·
3 following
https://github.com/darkangrycoder
AI & ML interests
None yet
Recent Activity
reacted
to
lewtun
's
post
with β€οΈ
10 days ago
Introducing OpenR1-Math-220k! https://huggingface.co/datasets/open-r1/OpenR1-Math-220k The community has been busy distilling DeepSeek-R1 from inference providers, but we decided to have a go at doing it ourselves from scratch πͺ Whatβs new compared to existing reasoning datasets? βΎ Based on https://huggingface.co/datasets/AI-MO/NuminaMath-1.5: we focus on math reasoning traces and generate answers for problems in NuminaMath 1.5, an improved version of the popular NuminaMath-CoT dataset. π³ 800k R1 reasoning traces: We generate two answers for 400k problems using DeepSeek R1. The filtered dataset contains 220k problems with correct reasoning traces. π 512 H100s running locally: Instead of relying on an API, we leverage vLLM and SGLang to run generations locally on our science cluster, generating 180k reasoning traces per day. β³ Automated filtering: We apply Math Verify to only retain problems with at least one correct answer. We also leverage Llama3.3-70B-Instruct as a judge to retrieve more correct examples (e.g for cases with malformed answers that canβt be verified with a rules-based parser) π We match the performance of DeepSeek-Distill-Qwen-7B by finetuning Qwen-7B-Math-Instruct on our dataset. π Read our blog post for all the nitty gritty details: https://huggingface.co/blog/open-r1/update-2
updated
a Space
4 months ago
tdnathmlenthusiast/German_TTS
updated
a Space
4 months ago
tdnathmlenthusiast/Technical_Interview_SpeechT5
View all activity
Organizations
tdnathmlenthusiast
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
5 months ago
Running
312
312
Qwen2.5 72B Instruct
β‘
Generate responses in a chat with Qwen, a helpful assistant
liked
a Space
over 1 year ago
Sleeping
1
1
Online Course Categorize System
π