Mixtures of Deep Neural Experts for Automated Speech Scoring Paper • 2106.12475 • Published Jun 23, 2021 • 2
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Paper • 2410.01036 • Published Oct 1 • 14
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Paper • 2410.01036 • Published Oct 1 • 14
Training dynamic models using early exits for automatic speech recognition on resource-constrained devices Paper • 2309.09546 • Published Sep 18, 2023 • 1
Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding Paper • 2305.13899 • Published May 23, 2023
Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters Paper • 2402.00828 • Published Feb 1
An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding Paper • 2211.08161 • Published Nov 15, 2022
Large Language Models Are Strong Audio-Visual Speech Recognition Learners Paper • 2409.12319 • Published Sep 18
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Paper • 2410.01036 • Published Oct 1 • 14
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper • 2312.03694 • Published Dec 6, 2023 • 2