The bilingual English/Chinese Baichuan2-7B-Chat VLM trained via LORA for https://arxiv.org/abs/2406.11665.
The Chinese half of the training data used for multimodal alignment and visual instruction tuning is sampled from here.
- Downloads last month
- 12
Inference API (serverless) does not yet support model repos that contain custom code.