Korean Datasets, Reward Models for RLHF
-
heegyu/KoSafeGuard-8b-0503
Text Generation • Updated • 51 • 5 -
heegyu/ko-reward-model-helpful-1.3b-v0.2
Text Classification • Updated • 16 -
heegyu/ko-reward-model-safety-1.3b-v0.2
Text Classification • Updated • 9 • 5 -
heegyu/ko-reward-model-helpful-roberta-large-v0.1
Text Classification • Updated • 17 • 1