20 5 4

Hanbin Wang

hanbin

https://wanghanbinpanda.github.io/

wanghanbinpanda

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model about 5 hours ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

published a model about 7 hours ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

new activity 24 days ago

PRIME-RL/Eurus-2-7B-PRIME:real usage query

View all activity

Organizations

hanbin's activity

updated a model about 5 hours ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

Text Generation • Updated about 5 hours ago • 1

published a model about 7 hours ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

Text Generation • Updated about 5 hours ago • 1

New activity in PRIME-RL/Eurus-2-7B-PRIME 24 days ago

real usage query

#4 opened 24 days ago by

asidaddy

authored a paper about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55

updated 3 datasets about 1 month ago

updated 4 models about 1 month ago

PRIME-RL/EurusPRM-Stage2

Updated 23 days ago • 6.51k • 6

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 23 days ago • 734 • 60

PRIME-RL/Eurus-2-7B-SFT

Updated 23 days ago • 3.4k • 2

PRIME-RL/EurusPRM-Stage1

Updated 23 days ago • 6.4k • 4

updated a Space about 1 month ago

README

🏃

upvoted a paper about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55

commented a paper about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55 •

New activity in PRIME-RL/Eurus-2-RL-Data about 2 months ago

some empty code ground truths (roughly 1k in train)

#3 opened about 2 months ago by

rawsh

New activity in PRIME-RL/Eurus-2-7B-PRIME 2 months ago

Evaluation

#1 opened 2 months ago by

tugstugi

Add library_name and pipeline_tag

#2 opened 2 months ago by

nielsr

upvoted an article 2 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 25

published an article 2 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 25

liked a model 2 months ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 23 days ago • 734 • 60