UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks Paper • 2407.02158 • Published Jul 2 • 1
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 2 days ago • 30
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published Nov 12 • 27
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 12 days ago • 131
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published 12 days ago • 28
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3 • 87
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 27 days ago • 289
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Paper • 2411.08033 • Published Nov 12 • 22
VoxPopuli v2 Collection A collection of checkpoints from the second VoxPopuli release. • 35 items • Updated Jan 16 • 5
VoxPopuli Collection A collection of open-source artefacts (datasets + checkpoints) from the first VoxPopuli release. • 32 items • Updated Jan 16 • 4
Robust Wav2Vec 2.0 Collection A collection of "robust" Wav2Vec 2.0 checkpoints pre-trained on datasets from multiple domains. • 4 items • Updated Jan 16 • 3
XLSR Collection A collection of multilingual Wav2Vec 2.0 checkpoints pre-trained on 53 languages and fine-tuned for CTC speech recognition. • 12 items • Updated Jan 16 • 6
Wav2Vec 2.0 Collection A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data. • 8 items • Updated Jan 16 • 18
SeamlessM4T Collection SeamlessM4T is designed to provide high quality translation, allowing people from different linguistic communities to communicate effortlessly. • 9 items • Updated Jan 16 • 14