Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sthenno
/
tempesthenno-ppo-ckpt40
like
4
Safetensors
nvidia/HelpSteer2
English
Chinese
qwen2
RLHF
PPO
custom-research
Eval Results
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
tempesthenno-ppo-ckpt40
Commit History
Adding Evaluation Results (
#1
)
915a7cd
verified
sthenno
commited on
17 days ago
Update README.md
c7e00f9
verified
sthenno
commited on
17 days ago
Update README.md
c707d0c
verified
sthenno
commited on
18 days ago
Upload folder using huggingface_hub
5ebb0a8
verified
sthenno
commited on
29 days ago
initial commit
578c1c8
verified
sthenno
commited on
29 days ago