--- language: - en - zh license: apache-2.0 tags: - llava - vlm datasets: - LinkSoul/Chinese-LLaVA-Vision-Instructions --- The bilingual English/Chinese Baichuan2-7B-Chat VLM trained via LORA for https://arxiv.org/abs/2406.11665. The Chinese half of the training data used for multimodal alignment and visual instruction tuning is sampled from [here](https://huggingface.co/datasets/LinkSoul/Chinese-LLaVA-Vision-Instructions).