metadata
license: cc-by-nc-4.0
library_name: transformers
pipeline_tag: text-generation
tags:
- VILA
- VLM
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
The code is release at https://github.com/Efficient-Large-Model/VILA. Welcome to have a try and share your feedbacks!