7 12 91

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

upvoted a paper 19 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

updated a Space 19 days ago

MMInstruction/VL-RewardBench

View all activity

Organizations

Zhihui's activity

upvoted a paper 10 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 16 days ago • 78

upvoted a paper 19 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 21 days ago • 42

upvoted a paper about 2 months ago

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 10

upvoted a paper 4 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

upvoted 2 papers 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 160

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Paper • 2404.12387 • Published Apr 18, 2024 • 38

upvoted a paper 7 months ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20, 2024 • 12

upvoted an article 7 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 66

upvoted 2 papers 7 months ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 9

Calibrating Reasoning in Language Models with Internal Consistency

Paper • 2405.18711 • Published May 29, 2024 • 6

upvoted a collection 7 months ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Collection

Outstanding research in interpretability and evaluation of language models, summarized • 93 items • Updated 5 days ago • 96

upvoted a paper about 1 year ago

Silkie: Preference Distillation for Large Visual Language Models

Paper • 2312.10665 • Published Dec 17, 2023 • 11