LLaDA
Large Language Diffusion Models
Unified Framework for Generalized Video Face Restoration
Scalable and Versatile 3D Generation from images
Find similar images from a collection
Detect and annotate poses in images and videos
FitDiT is a high-fidelity virtual try-on model.
Upgraded to v1.0!
Gaze detection using Moondream
Extract clothing from images using a mask
Execute commands from environment
A demo of Indic Parler-TTS
Generate anime-style multi-view images from texts
Create top-quality 3D(.GLB) models from text or images
Optical illusions and style transfer with FLUX