Demo of the Transformers implementation of ColPali
VLMEvalKit Eval Results in video understanding benchmark