8 8 10

Manli Shu

Manli

azshue

AI & ML interests

None yet

Recent Activity

new activity 21 days ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5:how to finetune xgen-mm?

liked a dataset 3 months ago

Salesforce/ProVision-10M

updated a model 6 months ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5

View all activity

Organizations

Manli's activity

New activity in Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 21 days ago

how to finetune xgen-mm?

#5 opened 2 months ago by

usr256864

liked a dataset 3 months ago

Salesforce/ProVision-10M

Viewer • Updated Feb 3 • 24.5M • 1.01k • 15

updated a model 6 months ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5

Image-Text-to-Text • Updated Feb 3 • 6.12k • 51

New activity in Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 6 months ago

Dataset link doesn't work?

#1 opened 7 months ago by

dibmvt

Extremely high GPU requirements for both basic (demo.ipynb) and batch (batch_inference.ipynb) notebooks

#3 opened 7 months ago by

dwb2023

upvoted a paper 7 months ago

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Paper • 2408.12590 • Published Aug 22, 2024 • 36

New activity in Salesforce/xgen-mm-phi3-mini-base-r-v1 7 months ago

Link model to paper

#1 opened 7 months ago by

nielsr

New activity in Salesforce/xgen-mm-phi3-mini-instruct-r-v1 7 months ago

Link model to paper

#12 opened 7 months ago by

nielsr

liked 4 models 7 months ago

authored a paper 7 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 99

New activity in Salesforce/xgen-mm-phi3-mini-base-r-v1.5 7 months ago

Upload examples.

#2 opened 7 months ago by

an-yan

Update README.md

#1 opened 7 months ago by

an-yan

upvoted a paper 7 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 99

upvoted a collection 8 months ago

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 58

liked 2 datasets 8 months ago

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21, 2024 • 623M • 283k • 82

TIGER-Lab/VisualWebInstruct-Seed

Viewer • Updated Feb 5 • 60.3k • 450 • 16

authored a paper 9 months ago

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Paper • 2209.07511 • Published Sep 15, 2022