Spaces:
Running
on
Zero
A newer version of the Gradio SDK is available:
5.34.2
title: SonicVerse
emoji: πΌ
colorFrom: purple
colorTo: red
sdk: gradio
sdk_version: 5.25.2
app_file: app.py
pinned: false
πΌ SonicVerse
An interactive demo for SonicVerse, a music captioning model, allowing users to input audio and generate a natural language caption that includes a general description of the music as well as music features such as key, instruments, genre, mood / theme, vocals gender.
The demo supports both short (10s) and long (up to 1 minute) audio inputs.
π Demo
Check out the live Space here:
π Samples
Short captions and long chained LLM-generated captions: β‘οΈ Samples page
π¦ Features
β Upload a 10 second music clip and get a caption
β Upload a long music clip (upto 1 minute for successful demo) to get a long detailed caption for the whole music clip.
β Captions include musical attributes (key, instruments, tempo, etc.)
β οΈ You can upload audio of any length, but due to compute limitations on Hugging Face Spaces, we recommend uploading clips under 30 seconds unless you have a Hugging Face Pro account or run the app locally.
π οΈ How to Run Locally
# Clone the repo
git clone https://github.com/AMAAI-Lab/SonicVerse
cd SonicVerse
# Install dependencies
pip install -r requirements.txt
# Alternatively, set up conda environment
conda env create -f environment.yml
conda activate sonicverse
# Run the app
python app.py
π‘ Usage
To use the app:
- Select audio clip to input
- Click the Generate button.
- See the modelβs output below.
π Citation
If you use SonicVerse in your work, please cite our paper:
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning Anuradha Chopra, Abhinaba Roy, Dorien Herremans Accepted to AIMC 2025
@article{chopra2025sonicverse,
title={SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning},
author={Chopra, Anuradha and Roy, Abhinaba and Herremans, Dorien},
journal={Proceedings of the 6th Conference on AI Music Creativity (AIMC 2025)},
year={2025},
address={Brussels, Belgium},
month={September},
url={https://arxiv.org/abs/2506.15154},
}
Read the paper here: arXiv:2506.15154 DOI: 10.48550/arXiv.2506.15154