7 12 91

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

upvoted a paper 19 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

updated a Space 19 days ago

MMInstruction/VL-RewardBench

View all activity

Organizations

Zhihui's activity

upvoted a paper 10 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 16 days ago • 78

upvoted a paper 19 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 21 days ago • 42

updated a Space 19 days ago

Running

🥇

VL RewardBench

updated a dataset 19 days ago

MMInstruction/VL-RewardBench

Viewer • Updated 14 days ago • 1.25k • 274 • 4

New activity in MMInstruction/VL-RewardBench 19 days ago

fix preprocessing errors

#1 opened 19 days ago by

Zhihui

liked a dataset 28 days ago

bigcode/bigcodebench

Viewer • Updated Sep 10, 2024 • 3.42k • 4.13k • 49

New activity in MMInstruction/VL-RewardBench about 1 month ago

Remove redundant fields

#4 opened about 1 month ago by

Zhihui

Cleanup parquet data

#3 opened about 1 month ago by

Zhihui

updated a dataset about 1 month ago

Zhihui/Vl-RewardBench

Viewer • Updated Dec 8, 2024 • 1.25k • 129

liked 2 datasets about 1 month ago

lmarena-ai/PPE-MBPP-Plus-Best-of-K

Viewer • Updated Oct 22, 2024 • 507 • 100 • 1

MMInstruction/VL-RewardBench

Viewer • Updated 14 days ago • 1.25k • 274 • 4

authored 3 papers about 2 months ago

Pretraining in Deep Reinforcement Learning: A Survey

Paper • 2211.03959 • Published Nov 8, 2022 • 1

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Paper • 2410.09421 • Published Oct 12, 2024

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 10

liked a Space about 2 months ago

Running

🥇

VL RewardBench

upvoted a paper about 2 months ago

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 10

liked 2 datasets about 2 months ago

allenai/olmo-mix-1124

Viewer • Updated Dec 2, 2024 • 99.1M • 6.03k • 25

codeparrot/apps

Viewer • Updated Oct 20, 2022 • 20k • 4.1k • 136

liked 2 models 3 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated about 21 hours ago • 106k • 383

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 5.14M • • 3.42k