Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

thorirhrafn
/
gpt1B_reward_model2

PEFT
TensorBoard
Safetensors
trl
reward-trainer
Generated from Trainer
Model card Files Files and versions Metrics Training metrics Community
gpt1B_reward_model2 / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
thorirhrafn's picture
thorirhrafn
Training in progress, epoch 2
74209d3 verified over 1 year ago
  • Apr26_19-47-10_gpu-2
    Training in progress, epoch 0 over 1 year ago
  • Apr26_19-49-14_gpu-2
    Training in progress, epoch 0 over 1 year ago
  • Apr26_19-51-33_gpu-2
    Training in progress, epoch 0 over 1 year ago
  • Apr26_19-53-32_gpu-2
    Training in progress, epoch 0 over 1 year ago
  • Apr26_19-56-04_gpu-2
    Training in progress, epoch 0 over 1 year ago
  • Apr26_20-02-35_gpu-2
    Training in progress, epoch 1 over 1 year ago
  • Apr26_20-06-30_gpu-2
    Training in progress, epoch 1 over 1 year ago
  • Apr26_20-08-32_gpu-2
    Training in progress, epoch 1 over 1 year ago
  • Apr28_17-47-26_gpu-5
    Training in progress, epoch 2 over 1 year ago