bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated 19 days ago • 300k • 561
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 444k • 1.47k
moonshotai/Kimi-VL-A3B-Thinking-2506 Image-Text-to-Text • 16B • Updated 16 days ago • 36.8k • 264
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training Paper • 2506.05301 • Published Jun 5 • 55