bczhou
/

tiny-llava-v1-hf

Image-Text-to-Text

vision-language

Inference Endpoints

Model card Files Files and versions Community

bczhou commited on Jan 12

Commit

a359739

•

1 Parent(s): 8d3105c

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -8,6 +8,8 @@ language:
 library_name: transformers
 ---
 ## Model type
 TinyLLaVA, a tiny model (1.4B) trained using the exact training recipe of [LLaVA-1.5](https://github.com/haotian-liu/LLaVA).
 We trained our TinyLLaVA using [TinyLlama](https://huggingface.co/PY007/TinyLlama-1.1B-Chat-v0.3) as our LLM backbone, and [clip-vit-large-patch14-336](https://huggingface.co/openai/clip-vit-large-patch14-336) as our vision backbone.

 library_name: transformers
 ---
+# WORK IN PROGRESS
 ## Model type
 TinyLLaVA, a tiny model (1.4B) trained using the exact training recipe of [LLaVA-1.5](https://github.com/haotian-liu/LLaVA).
 We trained our TinyLLaVA using [TinyLlama](https://huggingface.co/PY007/TinyLlama-1.1B-Chat-v0.3) as our LLM backbone, and [clip-vit-large-patch14-336](https://huggingface.co/openai/clip-vit-large-patch14-336) as our vision backbone.