WavLLM: Towards Robust and Adaptive Speech Large Language Model Paper • 2404.00656 • Published Mar 31 • 10
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization Paper • 2404.09956 • Published Apr 15 • 11
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Paper • 2006.11477 • Published Jun 20, 2020 • 5