3 5 11

Ruohong Zhang

ruohongz

RifleZhang

AI & ML interests

LM pre-training

Recent Activity

updated a dataset 11 days ago

ShareGPTVideo/train_video_and_instruction

updated a dataset 11 days ago

Share4oReasoning/sft_data

liked a dataset 21 days ago

MMInstruction/VL-RewardBench

View all activity

Organizations

ruohongz's activity

updated 2 datasets 11 days ago

ShareGPTVideo/train_video_and_instruction

Updated 11 days ago • 1.46k • 20

Share4oReasoning/sft_data

Viewer • Updated 11 days ago • 404k • 145 • 1

liked a dataset 21 days ago

MMInstruction/VL-RewardBench

Viewer • Updated 1 day ago • 1.25k • 243 • 4

updated a dataset about 2 months ago

ShareGPTVideo/train_raw_video

Viewer • Updated Oct 31 • 64.1k • 141 • 1

updated 2 models about 2 months ago

ShareGPTVideo/LLaVA-Hound-DPO

Text Generation • Updated Oct 27 • 28 • 3

ShareGPTVideo/LLaVA-Hound-Pretrain

Text Generation • Updated Oct 27 • 14 • 1

New activity in ShareGPTVideo/LLaVA-Hound-SFT about 2 months ago

Add link to paper

#1 opened about 2 months ago by

nielsr

updated a model about 2 months ago

ShareGPTVideo/LLaVA-Hound-SFT

Image-Text-to-Text • Updated Oct 27 • 54 • 2

upvoted a paper 2 months ago

Scalable Ranked Preference Optimization for Text-to-Image Generation

Paper • 2410.18013 • Published Oct 23 • 14

authored a paper 2 months ago

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21 • 22

upvoted a paper 2 months ago

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21 • 22

commented a paper 2 months ago

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21 • 22 •

authored 3 papers 2 months ago

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Paper • 2310.11667 • Published Oct 18, 2023 • 2

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Paper • 2311.10614 • Published Nov 17, 2023

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1 • 10

upvoted a paper 3 months ago

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Paper • 2408.16293 • Published Aug 29 • 25

liked a dataset 3 months ago

MathLLMs/MathVision

Viewer • Updated Oct 10 • 3.34k • 5.42k • 34

upvoted a paper 4 months ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29 • 92

liked a dataset 4 months ago

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated Oct 22 • 3.72M • 11.9k • 149

New activity in ShareGPTVideo/test_raw_video_data 6 months ago

where can I find the dpo video?

#2 opened 6 months ago by

Wiselnn