Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens Paper • 2503.01710 • Published 22 days ago • 5
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 4 days ago • 755k • 1.22k