cognitivecomputations/Dolphin3.0-R1-Mistral-24B Text Generation β’ Updated 3 days ago β’ 1.04k β’ 82
HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution Paper β’ 2501.10045 β’ Published 24 days ago β’ 9
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper β’ 2501.12380 β’ Published 20 days ago β’ 81
GSTAR: Gaussian Surface Tracking and Reconstruction Paper β’ 2501.10283 β’ Published 24 days ago β’ 5
mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated Image-Text-to-Text β’ Updated Dec 16, 2024 β’ 356 β’ 6
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Paper β’ 2501.13928 β’ Published 18 days ago β’ 16
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla β’ 21 days ago β’ 60
view article Article Timm β€οΈ Transformers: Use any timm model with transformers 25 days ago β’ 39
Multimodal LLMs Can Reason about Aesthetics in Zero-Shot Paper β’ 2501.09012 β’ Published 26 days ago β’ 10