MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Paper
•
2412.05237
•
Published
•
46
Thanks!
Thanks a lot! This is exactly what I need now! I am now looking for GPUs for the web demo and fine-tune other base models. Is there any links for application? Thanks again!
Thank you for your promotion!