No description provided.

@article {xie2024sonicvisionlm,
title={SonicVisionLM: Playing Sound with Vision Language Models},
author={Xie, Zhifeng and Yu, Shengye and Li, Mengtian and He, Qile and Chen, Chaofeng and Jiang, Yu-Gang},
journal={arXiv preprint arXiv:2401.04394},
year={2024}
}

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment