Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yoon's picture

Yoon

Sangwoong
WillBankes's profile picture timxiaohangt's profile picture
·
https://swyoon.github.io
  • WoongSSang
  • swyoon

AI & ML interests

Generative Modeling, Reinforcement Learning

Recent Activity

updated a model 5 days ago
RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3
published a model 5 days ago
RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter3
updated a model 5 days ago
RegularizedSelfPlay/sppo_reversekl-0.1-Gemma-2-2B-IT-RSPO-Iter2
View all activity

Organizations

swyoontestorg's profile picture Regularized Self-Play's profile picture Robust-Decoding's profile picture Think Before Imitate's profile picture

models 1

Sangwoong/gemma2-2b-it-hh-dpo-helpful-step-8000

Text Generation • 3B • Updated Mar 11 • 2

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs