BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining Paper • 2401.06443 • Published Jan 12 • 2
X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment Paper • 2403.11399 • Published Mar 18 • 6