Introduction
MAmmoTH-VL2, the model trained with VisualWebInstruct.
Links
Citation
@article{visualwebinstruct,
title={VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search},
author = {Jia, Yiming and Li, Jiachen and Yue, Xiang and Li, Bo and Nie, Ping and Zou, Kai and Chen, Wenhu},
journal={arXiv preprint arXiv:2503.10582},
year={2025}
}
- Downloads last month
- 26
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.