nvidia/canary-1b-flash Automatic Speech Recognition • 0.8B • Updated about 9 hours ago • 53.4k • 241
nvidia/parakeet-tdt-0.6b-v2 Automatic Speech Recognition • Updated about 9 hours ago • 465k • 1.29k
speechbrain/lang-id-voxlingua107-ecapa Audio Classification • Updated Nov 27, 2024 • 122k • 124
bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated 20 days ago • 300k • 562
Encoders vs Decoders: the Ettin Suite Collection A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16 • 18
GLiNER-X Collection The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated Jun 24 • 20
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others • Jun 23 • 52