32 75 92

Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

liked a model 3 days ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

liked a model 3 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1

View all activity

Organizations

smajumdar94's activity

upvoted a paper 2 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 6 days ago • 42

liked 2 models 3 days ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • Updated 10 days ago • 3.72k • 89

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • Updated 6 days ago • 16.7k • 197

liked a Space 3 days ago

Canary 1B Flash

🐤

Canary 1B Flash demo

upvoted an article 6 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

8 days ago

• 29

liked a dataset 7 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 8 days ago • 15.2M • 5.32k • 224

liked a Space 8 days ago

263

Thera Arbitrary-Scale Super-Resolution

🔥

Enhance image quality with real-time super-resolution

upvoted a paper 9 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published 13 days ago • 26

liked a model 12 days ago

sesame/csm-1b

Text-to-Speech • Updated 10 days ago • 37.7k • 1.64k

liked a model 14 days ago

nvidia/DeepSeek-R1-FP4

Text Generation • Updated 27 days ago • 11.4k • 225

liked a dataset 14 days ago

open-r1/codeforces

Viewer • Updated about 22 hours ago • 10k • 1.22k • 27

liked a model 15 days ago

RekaAI/reka-flash-3

Updated 13 days ago • 4.7k • 336

liked a dataset 18 days ago

deepmind/code_contests

Viewer • Updated Jun 11, 2023 • 4.04k • 10k • 159

liked a model 20 days ago

Qwen/QwQ-32B

Text Generation • Updated 15 days ago • 616k • • 2.52k

liked a Space 28 days ago

342

AI Deadlines

⚡

Schedule tasks efficiently using AI-generated deadlines

upvoted 3 papers about 1 month ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 32

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 75

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 34

upvoted an article about 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.19k