InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper • 2407.03320 • Published Jul 3 • 93
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 124