Yoon's picture

Yoon

Sangwoong

·

https://swyoon.github.io

AI & ML interests

Generative Modeling, Reinforcement Learning

Recent Activity

updated a model 5 days ago

RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3

published a model 5 days ago

RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3

updated a model 5 days ago

RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter2

View all activity

Organizations

models 1

Sangwoong/gemma2-2b-it-hh-dpo-helpful-step-8000

Text Generation • 3B • Updated Mar 11 • 2

datasets 0

None public yet