File size: 308 Bytes
6224cbc
 
b235af2
6224cbc
b235af2
 
1
2
3
4
5
6
7
---

license: llama2
pipeline_tag: image-text-to-text
---


This repository contains the Elva-Vicuna-7B model presented in [On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning](https://huggingface.co/papers/2406.11823).