9 15 10

Wei Liu

PeterV09

https://vpeterv.github.io

AI & ML interests

Machine Learning, Natural Language Processing

Recent Activity

updated a collection about 12 hours ago

M-STAR

updated a collection about 12 hours ago

M-STAR

updated a model about 12 hours ago

hkust-nlp/mstar-prm-8b-v1.0

View all activity

Organizations

PeterV09's activity

updated a collection about 12 hours ago

M-STAR

Collection

Resources of M-STAR (Multimodal Self-Evolving Training for Reasoning) https://mstar-lmm.github.io/ • 2 items • Updated about 12 hours ago

updated a model about 12 hours ago

hkust-nlp/mstar-prm-8b-v1.0

Updated about 12 hours ago

updated a model about 13 hours ago

hkust-nlp/mstar-8b-v1.0

Updated about 13 hours ago • 2

upvoted a paper 1 day ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 2 days ago • 27

commented a paper 1 day ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 2 days ago • 27 •

upvoted a paper 1 day ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 3 days ago • 32

upvoted a paper 3 months ago

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28 • 19

updated a dataset 3 months ago

hkustnlpvlm/sub_MathV360KLabeled_Iter0

Preview • Updated Sep 29 • 5

commented a paper 3 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 60 •

upvoted 2 papers 3 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 60

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

updated a model 5 months ago

hkustnlpvlm/prmiter1_onlyllm_50k

Updated Jul 24 • 2

upvoted an article 5 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 108

upvoted a paper 5 months ago

Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

Paper • 2407.10058 • Published Jul 14 • 29

authored a paper 6 months ago

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Paper • 2407.08733 • Published Jul 11 • 20

upvoted a paper 6 months ago

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Paper • 2407.08733 • Published Jul 11 • 20

commented a paper 6 months ago

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Paper • 2407.08733 • Published Jul 11 • 20 •

authored a paper 6 months ago

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

upvoted a paper 6 months ago

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16