MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models Paper • 2410.17637 • Published Oct 23, 2024 • 35
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models Paper • 2406.12042 • Published Jun 17, 2024 • 8
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion Paper • 2311.14836 • Published Nov 24, 2023 • 2
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2 Paper • 2401.06183 • Published Jan 11, 2024 • 1
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART Paper • 2403.00212 • Published Mar 1, 2024