4 6 13

Sarthak Thakur

sarthak247

AI & ML interests

None yet

Recent Activity

updated a model about 4 hours ago

sarthak247/Wan2.1-T2V-1.3B-nf4

published a model about 4 hours ago

sarthak247/Wan2.1-T2V-1.3B-nf4

new activity about 6 hours ago

Wan-AI/Wan2.1-T2V-1.3B:Wan 2.1 Ultra Advanced Gradio APP for - Works as low as 4GB VRAM - 1-Click Installers for Windows, RunPod, Massed Compute - Batch Processing - T2V - I2V - V2V

View all activity

Organizations

sarthak247's activity

updated a model about 4 hours ago

sarthak247/Wan2.1-T2V-1.3B-nf4

Text-to-Video • Updated about 4 hours ago

published a model about 4 hours ago

sarthak247/Wan2.1-T2V-1.3B-nf4

Text-to-Video • Updated about 4 hours ago

New activity in Wan-AI/Wan2.1-T2V-1.3B about 6 hours ago

Wan 2.1 Ultra Advanced Gradio APP for - Works as low as 4GB VRAM - 1-Click Installers for Windows, RunPod, Massed Compute - Batch Processing - T2V - I2V - V2V

#3 opened 2 days ago by

MonsterMMORPG

New activity in google/siglip2-base-patch16-224 about 15 hours ago

Error while loading processor: TypeError: expected str, bytes or os.PathLike object, not NoneType

#2 opened 6 days ago by

armamut

liked a model 3 days ago

microsoft/wham

Updated 7 days ago • 230

upvoted 5 papers 3 days ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published 10 days ago • 29

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Paper • 2502.14397 • Published 7 days ago • 34

Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model

Paper • 2502.13449 • Published 9 days ago • 42

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 7 days ago • 88

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 7 days ago • 150

liked 6 models 3 days ago

updated a collection 4 days ago

Qwen2.5-3B-GRPO

Collection

Trained with unsloth on just 250 steps (resource constraints) on GSM8K to add reasoning abilities to Qwen2.5-3B (smaller model because resources) • 3 items • Updated 4 days ago

updated a model 4 days ago

sarthak247/qwen2.5-grpo-gsm8k-250steps-gguf

Updated 4 days ago • 62

published a model 4 days ago

sarthak247/qwen2.5-grpo-gsm8k-250steps-gguf

Updated 4 days ago • 62

updated a collection 4 days ago

Qwen2.5-3B-GRPO

Collection

Trained with unsloth on just 250 steps (resource constraints) on GSM8K to add reasoning abilities to Qwen2.5-3B (smaller model because resources) • 3 items • Updated 4 days ago