Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
19
5
4
Hanbin Wang
hanbin
Follow
Lynncc6's profile picture
BryantMcGill's profile picture
junwux's profile picture
13 followers
·
3 following
https://wanghanbinpanda.github.io/
wanghanbinpanda
AI & ML interests
Code Intelligence and LLM Reasoning (Code, Math)
Recent Activity
authored
a paper
about 21 hours ago
Process Reinforcement through Implicit Rewards
updated
a dataset
about 24 hours ago
PRIME-RL/Eurus-2-RL-Data
updated
a dataset
about 24 hours ago
PRIME-RL/EurusPRM-Stage1-Data
View all activity
Articles
Process Reinforcement through Implicit Rewards
Jan 3
•
22
Organizations
hanbin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
authored
a paper
about 21 hours ago
Process Reinforcement through Implicit Rewards
Paper
•
2502.01456
•
Published
1 day ago
•
44
updated
3 datasets
about 24 hours ago
PRIME-RL/Eurus-2-RL-Data
Viewer
•
Updated
about 24 hours ago
•
483k
•
730
•
24
PRIME-RL/EurusPRM-Stage1-Data
Viewer
•
Updated
about 24 hours ago
•
463k
•
131
•
4
PRIME-RL/Eurus-2-SFT-Data
Viewer
•
Updated
about 24 hours ago
•
230k
•
287
•
10
updated
4 models
about 24 hours ago
PRIME-RL/EurusPRM-Stage2
Updated
about 24 hours ago
•
721
•
6
PRIME-RL/Eurus-2-7B-PRIME
Text Generation
•
Updated
about 24 hours ago
•
2.25k
•
59
PRIME-RL/Eurus-2-7B-SFT
Updated
about 24 hours ago
•
7.67k
•
2
PRIME-RL/EurusPRM-Stage1
Updated
about 24 hours ago
•
182
•
4
updated
a Space
about 24 hours ago
Running
README
🏃
upvoted
a
paper
1 day ago
Process Reinforcement through Implicit Rewards
Paper
•
2502.01456
•
Published
1 day ago
•
44
commented
a paper
1 day ago
Process Reinforcement through Implicit Rewards
Paper
•
2502.01456
•
Published
1 day ago
•
44
•
1
New activity in
PRIME-RL/Eurus-2-RL-Data
8 days ago
some empty code ground truths (roughly 1k in train)
1
#3 opened 8 days ago by
rawsh
New activity in
PRIME-RL/Eurus-2-7B-PRIME
about 1 month ago
Evaluation
6
#1 opened about 1 month ago by
tugstugi
Add library_name and pipeline_tag
#2 opened about 1 month ago by
nielsr
upvoted
an
article
about 1 month ago
view article
Article
Process Reinforcement through Implicit Rewards
By
ganqu
•
Jan 3
•
22
liked
a model
about 1 month ago
PRIME-RL/Eurus-2-7B-PRIME
Text Generation
•
Updated
about 24 hours ago
•
2.25k
•
59
updated
2 datasets
about 1 month ago
PRIME-RL/Eurus-2-SFT-Data
Viewer
•
Updated
about 24 hours ago
•
230k
•
287
•
10
PRIME-RL/Eurus-2-RL-Data
Viewer
•
Updated
about 24 hours ago
•
483k
•
730
•
24
updated
2 models
about 1 month ago
PRIME-RL/Eurus-2-7B-PRIME
Text Generation
•
Updated
about 24 hours ago
•
2.25k
•
59
PRIME-RL/Eurus-2-7B-SFT
Updated
about 24 hours ago
•
7.67k
•
2
Load more