Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
3
171
Jian Hu
chuyi777
Follow
jovanywang's profile picture
1 follower
·
1 following
https://hujian.website
hijkzzz
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
18 days ago
ProcessBench: Identifying Process Errors in Mathematical Reasoning
updated
a model
28 days ago
OpenRLHF/Llama-3-8b-rm-mixture
updated
a model
28 days ago
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
View all activity
Organizations
chuyi777
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
about 2 months ago
O1-OPEN/OpenO1-LLama-8B-v0.1
Updated
Oct 8
•
601
•
14
liked
3 models
2 months ago
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14
•
2.79k
•
321
Nexusflow/Athene-70B
Text Generation
•
Updated
Nov 15
•
7.33k
•
193
peiyi9979/mistral-7b-sft
Text Generation
•
Updated
Jan 15
•
1.29k
•
7
liked
2 datasets
2 months ago
nvidia/HelpSteer2
Viewer
•
Updated
10 days ago
•
21.4k
•
14.1k
•
390
GAIR/o1-journey
Viewer
•
Updated
Oct 16
•
327
•
1.25k
•
125
liked
2 models
3 months ago
peiyi9979/math-shepherd-mistral-7b-rl
Text Generation
•
Updated
Jan 15
•
394
•
5
peiyi9979/math-shepherd-mistral-7b-prm
Text Generation
•
Updated
Jan 15
•
19.8k
•
39
liked
a dataset
3 months ago
peiyi9979/Math-Shepherd
Viewer
•
Updated
Jan 3
•
445k
•
603
•
80
liked
a model
3 months ago
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
Updated
Oct 31
•
9.79k
•
66
liked
a dataset
4 months ago
Skywork/Skywork-Reward-Preference-80K-v0.1
Viewer
•
Updated
Oct 25
•
82k
•
957
•
41
liked
3 models
4 months ago
ai21labs/AI21-Jamba-1.5-Large
Text Generation
•
Updated
Sep 17
•
2.7k
•
205
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Sep 26
•
273k
•
626
microsoft/Phi-3.5-mini-instruct
Text Generation
•
Updated
Sep 18
•
507k
•
•
723
liked
a dataset
5 months ago
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
Jun 3, 2023
•
1.09M
•
214
•
43
liked
a model
5 months ago
mistralai/Codestral-22B-v0.1
Text Generation
•
Updated
Jul 31
•
3.36M
•
1.17k
liked
a model
7 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Oct 14
•
7.56k
•
52
liked
a dataset
7 months ago
RLHFlow/prompt-collection-v0.1
Viewer
•
Updated
May 8
•
179k
•
45
•
8
liked
a model
7 months ago
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
Oct 14
•
4.8k
•
38
liked
a dataset
7 months ago
weqweasdas/preference_dataset_mixture2_and_safe_pku
Viewer
•
Updated
Apr 29
•
555k
•
41
•
10
Load more