Hanze Dong

hendrydong

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago
hendrydong/llama3b-r0-step220
published a model 1 day ago
hendrydong/llama3b-r0-step220
updated a model 1 day ago
hendrydong/llama3b-r0-step200
View all activity

Organizations

Salesforce's profile picture OptimalScale's profile picture reward modeling's profile picture raft_study's profile picture FsfairX's profile picture RLHFlow's profile picture

hendrydong's activity

Update README.md

#1 opened 2 months ago by
hendrydong
New activity in Salesforce/LLaMA-3-8B-SFR-RM-R 2 months ago

Update README.md

#1 opened 2 months ago by
hendrydong
New activity in Salesforce/LLaMA-3-8B-SFR-SFT-R 2 months ago

Update README.md

#1 opened 2 months ago by
hendrydong
New activity in Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R 2 months ago

Update README.md

#2 opened 2 months ago by
hendrydong
New activity in RLHFlow/LLaMA3.2-1B-SFT 4 months ago
New activity in sfairXC/FsfairX-LLaMA3-RM-v0.1 5 months ago

Update README.md

#4 opened 11 months ago by
johnowhitaker

Update README.md

#6 opened 5 months ago by
Haoxiang-Wang
New activity in sfairXC/FsfairX-LLaMA3-RM-v0.1 11 months ago

Training details?

1
#2 opened 11 months ago by
MicPie
New activity in microsoft/phi-2 over 1 year ago