DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 3 days ago • 144
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published 4 days ago • 48
Ovis2.5 Collection Our next-generation MLLMs for native-resolution vision and advanced reasoning • 4 items • Updated 3 days ago • 45
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper • 2508.10711 • Published 4 days ago • 125
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation Paper • 2508.07901 • Published 7 days ago • 38
Wan 2.2 FP8 AoT Collection optimized demos for Wan 2.2 14B models, using FP8 quantization + AoT compilation & community LoRAs for fast & high quality inference on ZeroGPU 💨 • 3 items • Updated 10 days ago • 2
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 13 days ago • 460
CharacterShot: Controllable and Consistent 4D Character Animation Paper • 2508.07409 • Published 8 days ago • 36
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published 7 days ago • 67
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers Paper • 2507.12956 • Published Jul 17 • 24
Skywork-UniPic2 Collection Building Kontext Model with Online RL for Unified Multimodal Model • 8 items • Updated 5 days ago • 9
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Paper • 2508.03320 • Published 13 days ago • 59
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published 12 days ago • 49
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published 14 days ago • 123
view article Article Build an AI Shopping Assistant with Gradio MCP Servers By freddyaboulton • 18 days ago • 49
BANG: Dividing 3D Assets via Generative Exploded Dynamics Paper • 2507.21493 • Published 20 days ago • 61
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset Paper • 2507.21033 • Published 20 days ago • 20
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 20 days ago • 155