Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
106
35
23
TY.Zheng
aaabiao
Follow
21world's profile picture
xtsssss's profile picture
dongguanting's profile picture
20 followers
·
9 following
https://scholar.google.com/citations?user=Vq-VZnUAAAAJ&hl=zh-CN
Zheng0428
AI & ML interests
None yet
Recent Activity
updated
a dataset
13 days ago
aaabiao/dapo_filter
published
a dataset
13 days ago
aaabiao/dapo_filter
upvoted
a
paper
14 days ago
Agentic Reinforced Policy Optimization
View all activity
Organizations
Papers
25
arxiv:
2507.07017
arxiv:
2507.00432
arxiv:
2504.05535
arxiv:
2502.14739
Expand 25 papers
models
62
Sort: Recently updated
aaabiao/qwen3_14b_think_32B_math_reject_sampling_150step_0706
15B
•
Updated
Jul 5
•
3
aaabiao/qwen3_14b_no_think_32B_math_reject_sampling_150step_0706
15B
•
Updated
Jul 5
•
3
aaabiao/qwen3_14b_think_32B_math_reject_sampling_150step
15B
•
Updated
Jul 2
•
3
aaabiao/qwen3_14b_no_think_32B_math_reject_sampling_150step
15B
•
Updated
Jul 2
•
3
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step_fix2
15B
•
Updated
Jun 29
•
4
aaabiao/verl-8B-100step-v1
8B
•
Updated
Jun 29
•
4
aaabiao/verl-4B-100step-v1
4B
•
Updated
Jun 28
•
4
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step_fix
15B
•
Updated
Jun 28
•
6
aaabiao/verl-14B-60step-v1
15B
•
Updated
Jun 27
•
4
aaabiao/verl-14B-120step-v1
15B
•
Updated
Jun 27
•
4
View 62 models
datasets
11
Sort: Recently updated
aaabiao/dapo_filter
Preview
•
Updated
13 days ago
•
15
aaabiao/data_bon8
Viewer
•
Updated
28 days ago
•
261k
•
53
aaabiao/Transfer_Dataset
Viewer
•
Updated
Jul 7
•
39.9k
•
18
aaabiao/OpenThoughts2-1M-fiilter
Viewer
•
Updated
Jun 5
•
497k
•
6
aaabiao/neo-stage1
Viewer
•
Updated
May 4
•
1.55M
•
4
aaabiao/neo-stage2
Viewer
•
Updated
May 4
•
231k
•
3
aaabiao/RL-dataset
Preview
•
Updated
Apr 16
•
35
aaabiao/RL-datasets
Updated
Apr 9
•
1
aaabiao/Code_Data
Preview
•
Updated
Jan 10
•
671
aaabiao/DAG
Updated
Dec 29, 2024
•
34
View 11 datasets