shi-labs
's Collections
Multimodal AI
updated
🔍
OLA-VLM
🐐
CuMo 7b Zero
✌️
VCoder
shi-labs/vcoder_ds_llava-v1.5-13b
Text Generation
•
Updated
•
10
•
4
shi-labs/CuMo-mistral-7b
Text Generation
•
Updated
•
39
•
15
shi-labs/CuMo-mixtral-8x7b
Text Generation
•
Updated
•
25
•
3
shi-labs/vcoder_llava-v1.5-7b
Text Generation
•
Updated
•
10
•
2
shi-labs/vcoder_ds_llava-v1.5-7b
Text Generation
•
Updated
•
12
shi-labs/vcoder_llava-v1.5-13b
Text Generation
•
Updated
•
13
•
4
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Paper
•
2312.14233
•
Published
•
16
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Paper
•
2405.05949
•
Published
•
2
shi-labs/OLA-VLM-CLIP-ViT-Phi3-4k-mini
Image-Text-to-Text
•
Updated
•
37
•
1
shi-labs/OLA-VLM-CLIP-ConvNeXT-Llama3-8b
Image-Text-to-Text
•
Updated
•
25
•
1
shi-labs/OLA-VLM-CLIP-ConvNeXT-Phi3-4k-mini
Image-Text-to-Text
•
Updated
•
15
•
1
shi-labs/vpt_OLA-VLM-CLIP-ConvNeXT-Llama3-8b
Image-Text-to-Text
•
Updated
•
69
•
2
shi-labs/OLA-VLM-CLIP-ViT-Llama3-8b
Image-Text-to-Text
•
Updated
•
34
shi-labs/pretrain_dsg_OLA-VLM-CLIP-ViT-Llama3-8b
Image-Text-to-Text
•
Updated
•
74