---
language:
- en
- zh
license: apache-2.0
tags:
- llava
- vlm
datasets:
- LinkSoul/Chinese-LLaVA-Vision-Instructions
---

The bilingual English/Chinese Baichuan2-7B-Chat VLM trained via LORA for https://arxiv.org/abs/2406.11665.

The Chinese half of the training data used for multimodal alignment and visual instruction tuning is sampled from [here](https://huggingface.co/datasets/LinkSoul/Chinese-LLaVA-Vision-Instructions).