Rasmi PRO

rasmi

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

common-pile/comma-v0.1-1t

liked a dataset about 1 month ago

common-pile/caselaw_access_project

upvoted an article about 1 month ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

None yet

liked a model about 1 month ago

common-pile/comma-v0.1-1t

7B • Updated Jun 6 • 1.21k • 24

liked a dataset about 1 month ago

common-pile/caselaw_access_project

Viewer • Updated Jun 6 • 5.52M • 7.94k • 191

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 627

liked a dataset 4 months ago

LanguageShades/BiasShades

Viewer • Updated May 3 • 728 • 107 • 18

liked 2 models 5 months ago

mistralai/Mistral-Small-3.1-24B-Base-2503

24B • Updated 20 days ago • 7.76k • 246

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated 20 days ago • 255k • 1.3k

liked a Space 6 months ago

3.06k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 6 months ago

stepfun-ai/GOT-OCR-2.0-hf

Image-Text-to-Text • 0.6B • Updated Jan 31 • 67.3k • 212

liked a model 7 months ago

microsoft/phi-4

Text Generation • 15B • Updated Feb 24 • 659k • • 2.13k

liked a model 8 months ago

answerdotai/ModernBERT-large

Fill-Mask • 0.4B • Updated Jan 15 • 126k • 419

liked 2 models 9 months ago

PleIAs/Pleias-1.2b-Preview

1B • Updated Dec 5, 2024 • 988 • 20

google/paligemma2-3b-pt-896

Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 2.5k • 22

upvoted a collection 9 months ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Jul 10 • 150

liked a Space 10 months ago

2.56k

F5-TTS

🗣

F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)

liked a model 12 months ago

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • 4B • Updated Sep 26, 2024 • 586k • 702

liked 2 models about 1 year ago

google/gemma-2b

Text Generation • 3B • Updated Sep 27, 2024 • 236k • 1.06k

google/gemma-scope

Updated Aug 29, 2024 • 173

liked 2 datasets about 1 year ago

mlfoundations/MINT-1T-ArXiv

Viewer • Updated Sep 19, 2024 • 5.6M • 2.24k • 49

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 4.39k • 472

upvoted a collection about 1 year ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 78

Rasmi PRO

AI & ML interests

Recent Activity

Organizations

rasmi's activity

SmolLM3: smol, multilingual, long-context reasoner

The Ultra-Scale Playbook

F5-TTS