view article Article 🐺🐦⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark By wolfram • 1 day ago • 2
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper • 2501.04689 • Published 4 days ago • 13
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 4 days ago • 66
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published 6 days ago • 8
Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Paper • 2501.03931 • Published 5 days ago • 11
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides Paper • 2501.03936 • Published 5 days ago • 16
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper • 2501.04001 • Published 5 days ago • 36
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria • 5 days ago • 12
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper • 2412.21059 • Published 13 days ago • 18
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction Paper • 2501.03218 • Published 6 days ago • 30
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 7 days ago • 33
AutoPresent: Designing Structured Visuals from Scratch Paper • 2501.00912 • Published 11 days ago • 8
TransPixar: Advancing Text-to-Video Generation with Transparency Paper • 2501.03006 • Published 6 days ago • 19
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper • 2501.00599 • Published 12 days ago • 40
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 11 days ago • 91
3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation Paper • 2412.13059 • Published 26 days ago • 1
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 9 days ago • 29