Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yoon
Sangwoong
Follow
WillBankes's profile picture
timxiaohangt's profile picture
2 followers
·
4 following
https://swyoon.github.io
WoongSSang
swyoon
AI & ML interests
Generative Modeling, Reinforcement Learning
Recent Activity
updated
a model
5 days ago
RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3
published
a model
5 days ago
RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3
updated
a model
5 days ago
RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter2
View all activity
Organizations
models
1
Sangwoong/gemma2-2b-it-hh-dpo-helpful-step-8000
Text Generation
•
3B
•
Updated
Mar 11
•
2
datasets
0
None public yet