The bilingual English/Chinese Llama2-7B-Chat VLM trained via LORA for https://arxiv.org/abs/2406.11665.

The Chinese half of the training data used for multimodal alignment and visual instruction tuning is sampled from here.

Safetensors

Model size

7.06B params

Tensor type

F32

FP16

Inference Examples

Inference API (serverless) does not yet support model repos that contain custom code.

amitha
/

mllava-llama2-en-zh

Dataset used to train amitha/mllava-llama2-en-zh