IntelLabs
/

LlavaOLMoBitnet1B

Model card Files Files and versions Community

naveensp commited on Aug 23, 2024

Commit

20b2922

·

verified ·

1 Parent(s): bca6140

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: apache-2.0
 # Model Card: LlavaOLMoBitnet1B
-Multimodal Large Language Models (MM-LLMs) have seen significant advancements in the last year, demonstrating impressive performance across tasks. To truly democratize AI, apart from strong capabilities, models must run efficiently on small compute footprints to be accessible by most
 ## Paper Abstract

 # Model Card: LlavaOLMoBitnet1B
+Multimodal Large Language Models (MM-LLMs) have seen significant advancements in the last year, demonstrating impressive performance across tasks. However, to truly democratize AI, models must exhibit strong capabilities and be able to run efficiently on small compute footprints accessible by most. Part of this quest, we introduce LLaVaOLMoBitnet1B - the first Ternary Multimodal LLM capable of accepting Image(s)+Text inputs to produce coherent textual responses. The model is open-sourced along with weights and training scripts to encourage future research into ternary models. We also release a technical report highlighting the training proecss, challenges associated with ternary models and future oppurtunities.
 ## Paper Abstract