Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models Paper • 2404.13013 • Published Apr 19, 2024 • 30
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14, 2024 • 57
view article Article Docmatix - a huge dataset for Document Visual Question Answering Jul 18, 2024 • 72
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 243