Generate speech from text using a simplified TTS demo
Towards Unified Music Emotion Recognition across Dimensional