Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

VLM2Vec

community
https://github.com/TIGER-AI-Lab/VLM2Vec
Activity Feed

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

ziyjiang  updated a model 12 days ago
VLM2Vec/VLM2Vec-V2.0
MINGYISU  authored a paper 14 days ago
Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining
MINGYISU  authored a paper 14 days ago
VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents
View all activity

Xuan "Billy" Zhang's profile picture Rui's profile picture Ziyan Jiang's profile picture Xinyi Yang's profile picture Liu's profile picture MINGYI SU's profile picture

models 1

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated 12 days ago • 3.76k • 8

datasets 21

VLM2Vec/MomentSeeker

Viewer • Updated 29 days ago • 1.8k • 165

VLM2Vec/Charades-STA

Viewer • Updated 29 days ago • 727 • 132

VLM2Vec/QVHighlight

Viewer • Updated 29 days ago • 1.08k • 441

VLM2Vec/MMEB-V2

Updated Jun 13 • 237

VLM2Vec/Kinetics-700

Viewer • Updated May 31 • 1k • 397

VLM2Vec/ViDoRe_esg_reports_human_labeled_v2

Viewer • Updated May 31 • 1.72k • 6

VLM2Vec/ViDoRe_economics_reports_v2

Viewer • Updated May 30 • 1.42k • 7

VLM2Vec/ViDoRe_biomedical_lectures_v2_multilingual

Viewer • Updated May 30 • 3.74k • 7

VLM2Vec/ViDoRe_biomedical_lectures_v2

Viewer • Updated May 30 • 1.72k • 10

VLM2Vec/ViDoRe_esg_reports_v2_multilingual

Viewer • Updated May 30 • 2.68k • 5
View 21 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs