Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
10
18
Wei Xiong
weqweasdas
Follow
mzhaoshuai's profile picture
research4pan's profile picture
GigaBoy's profile picture
15 followers
·
2 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
updated
a dataset
about 1 hour ago
selfcorrexp2/llama31_first_wrong_and_first_corr_regular_norr_20k
updated
a dataset
about 2 hours ago
selfcorrexp2/llama31_first_wrong_and_first_corr_regular_norr
updated
a dataset
about 2 hours ago
selfcorrexp2/10k_llama31_first_wrong_math_chat_format
View all activity
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
about 1 month ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
Viewer
•
Updated
Nov 2
•
2.32M
•
77
•
4
liked
a model
about 1 month ago
RLHFlow/Llama3.1-8B-PRM-Mistral-Data
Text Generation
•
Updated
Nov 9
•
1.98k
•
6
liked
a model
4 months ago
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
Updated
Sep 6
•
857
•
21
liked
a model
5 months ago
RLHFlow/LLaMA3-SFT
Text Generation
•
Updated
Nov 3
•
5.75k
•
8
liked
3 models
7 months ago
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
Updated
Oct 14
•
8.22k
•
40
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
Sep 23
•
18k
•
161
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
Oct 14
•
4.22k
•
37
liked
5 models
8 months ago
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
Updated
May 31
•
5
•
11
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
Updated
May 31
•
13
•
7
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
Updated
Jun 12
•
209
•
76
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Oct 14
•
10.9k
•
52
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
Updated
Apr 24
•
23
•
8
liked
a model
9 months ago
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31
•
1.64k
•
22
liked
a Space
9 months ago
Running
300
📐
Reward Bench Leaderboard
liked
2 models
10 months ago
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
Mar 22
•
319
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
Updated
Mar 22
•
1.22k
•
17
liked
a model
over 1 year ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25
•
224
•
16
liked
a Space
over 1 year ago
Runtime error
66
🔥
Robin 7b