Now in 5 languages!
https://huggingface.co/papers/2501.03006
Gaze detection using Moondream
Audio Conditioned LipSync with Latent Diffusion Models
Create videos with FFMPEG + Qwen2.5-Coder
Build datasets using natural language
Scalable and Versatile 3D Generation from images