9 20 10

Wei Liu

PeterV09

https://vpeterv.github.io

AI & ML interests

Machine Learning, Natural Language Processing

Recent Activity

upvoted a paper 5 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

upvoted a paper 5 days ago

Cosmos World Foundation Model Platform for Physical AI

upvoted a paper 5 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

PeterV09's activity

upvoted 4 papers 5 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 5 days ago • 37

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 6 days ago • 55

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 9 days ago • 72

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published 6 days ago • 12

upvoted a paper 10 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 16 days ago • 78

upvoted 2 papers 20 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 21 days ago • 42

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 21 days ago • 45

upvoted a paper 3 months ago

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19

upvoted 2 papers 4 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted an article 6 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 110

upvoted 3 papers 6 months ago

Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

Paper • 2407.10058 • Published Jul 14, 2024 • 29

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Paper • 2407.08733 • Published Jul 11, 2024 • 21

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

upvoted an article 6 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 171

upvoted a paper 6 months ago

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers

Paper • 2406.16747 • Published Jun 24, 2024 • 18

upvoted a paper about 1 year ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 41

upvoted a collection about 1 year ago

Deita

Collection

14 items • Updated May 20, 2024 • 10

upvoted 2 papers over 1 year ago

TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT

Paper • 2307.08674 • Published Jul 17, 2023 • 48

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 22