12 10 19

Wei Xiong

weqweasdas

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset about 18 hours ago

selfcorrexp/distill_40koldc2r_120kw_84kcorr

published a dataset about 18 hours ago

selfcorrexp/distill_40koldc2r_120kw_84kcorr

updated a dataset about 18 hours ago

selfcorrexp/distill_0kc2r_120kw_84kcorr

View all activity

Organizations

weqweasdas's activity

liked a model 11 days ago

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

Text Generation • Updated Nov 9, 2024 • 21.5k • 34

liked a dataset 2 months ago

RLHFlow/RLHFlow-SFT-Dataset-ver2

Viewer • Updated Nov 2, 2024 • 2.32M • 74 • 5

liked a model 3 months ago

RLHFlow/Llama3.1-8B-PRM-Mistral-Data

Text Generation • Updated Nov 9, 2024 • 1.4k • 8

liked a model 5 months ago

NCSOFT/Llama-3-OffsetBias-RM-8B

Text Classification • Updated Sep 6, 2024 • 4.69k • 22

liked a model 6 months ago

RLHFlow/LLaMA3-SFT

Text Generation • Updated Nov 3, 2024 • 9.15k • 9

liked 3 models 8 months ago

liked 5 models 9 months ago

Salesforce/LLaMA-3-8B-SFR-RM-R

Text Classification • Updated 6 days ago • 22 • 11

Salesforce/LLaMA-3-8B-SFR-SFT-R

Text Generation • Updated 6 days ago • 33 • 8

Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R

Text Generation • Updated 6 days ago • 101 • 78

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • Updated Oct 14, 2024 • 5.98k • 54

sfairXC/FsfairX-Zephyr-Chat-v0.1

Text Generation • Updated Apr 24, 2024 • 58 • 8

liked a model 10 months ago

weqweasdas/RM-Mistral-7B

Text Classification • Updated Mar 31, 2024 • 1.41k • 22

liked a Space 10 months ago

Running

321

📐

Reward Bench Leaderboard

liked 2 models 11 months ago

weqweasdas/RM-Gemma-7B

Text Classification • Updated Mar 22, 2024 • 83 • 8

weqweasdas/RM-Gemma-2B

Text Classification • Updated Mar 22, 2024 • 6.12k • 22

liked a model over 1 year ago

weqweasdas/hh_rlhf_rm_open_llama_3b

Text Classification • Updated Feb 25, 2024 • 427 • 17

liked a Space almost 2 years ago

Runtime error

🔥