13 139 693

Reza Sayar PRO

Reza2kn

AI & ML interests

None yet

Recent Activity

liked a model about 7 hours ago

WueNLP/centurio_aya

liked a model about 7 hours ago

WueNLP/centurio_qwen

liked a dataset about 7 hours ago

WueNLP/SMPQA

View all activity

Organizations

Reza2kn's activity

upvoted an article about 12 hours ago

Article

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

•

1 day ago

• 2

upvoted 2 papers 3 days ago

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Paper • 2501.04689 • Published 4 days ago • 13

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 4 days ago • 66

upvoted 4 papers 4 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 5 days ago • 36

upvoted an article 4 days ago

Article

Synthetic Data Generation with FastData and Hugging Face

•

5 days ago

• 12

upvoted 3 papers 4 days ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published 13 days ago • 18

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published 6 days ago • 30

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 7 days ago • 33

upvoted 2 papers 5 days ago

AutoPresent: Designing Structured Visuals from Scratch

Paper • 2501.00912 • Published 11 days ago • 8

TransPixar: Advancing Text-to-Video Generation with Transparency

Paper • 2501.03006 • Published 6 days ago • 19

upvoted a collection 5 days ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 1 day ago • 209

upvoted an article 7 days ago

Article

Process Reinforcement through Implicit Rewards

•

9 days ago

• 15

upvoted 4 papers 8 days ago

LTX-Video: Realtime Video Latent Diffusion

Paper • 2501.00103 • Published 13 days ago • 40

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 12 days ago • 40

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 11 days ago • 91

3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation

Paper • 2412.13059 • Published 26 days ago • 1

upvoted an article 8 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

9 days ago

• 29