VLM2Vec

community

https://github.com/TIGER-AI-Lab/VLM2Vec

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

ziyjiang updated a model 12 days ago

VLM2Vec/VLM2Vec-V2.0

MINGYISU authored a paper 14 days ago

Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining

MINGYISU authored a paper 14 days ago

VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents

View all activity

models 1

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated 12 days ago • 3.76k • 8

datasets 21

VLM2Vec/MomentSeeker

Viewer • Updated 29 days ago • 1.8k • 165

VLM2Vec/Charades-STA

Viewer • Updated 29 days ago • 727 • 132

VLM2Vec/QVHighlight

Viewer • Updated 29 days ago • 1.08k • 441

VLM2Vec/MMEB-V2

Updated Jun 13 • 237

VLM2Vec/Kinetics-700

Viewer • Updated May 31 • 1k • 397

VLM2Vec/ViDoRe_esg_reports_human_labeled_v2

Viewer • Updated May 31 • 1.72k • 6

VLM2Vec/ViDoRe_economics_reports_v2

Viewer • Updated May 30 • 1.42k • 7

VLM2Vec/ViDoRe_biomedical_lectures_v2_multilingual

Viewer • Updated May 30 • 3.74k • 7

VLM2Vec/ViDoRe_biomedical_lectures_v2

Viewer • Updated May 30 • 1.72k • 10

VLM2Vec/ViDoRe_esg_reports_v2_multilingual

Viewer • Updated May 30 • 2.68k • 5

View 21 datasets