Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
vincentmin
/
llama-2-13b-reward-oasst1
like
0
Text Classification
PEFT
TensorBoard
tasksource/oasst1_pairwise_rlhf_reward
Generated from Trainer
trl
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
main
llama-2-13b-reward-oasst1
Commit History
Update README.md
5e99797
vincentmin
commited on
Aug 3, 2023
Update README.md
9d26c10
vincentmin
commited on
Jul 27, 2023
Update README.md
4f94bf4
vincentmin
commited on
Jul 27, 2023
Update README.md
37a71b7
vincentmin
commited on
Jul 27, 2023
update model card README.md
f0d55ff
vincentmin
commited on
Jul 27, 2023
End of training
e2fc4dc
vincentmin
commited on
Jul 27, 2023
update model card README.md
d5ec288
vincentmin
commited on
Jul 27, 2023
End of training
5d0d1e5
vincentmin
commited on
Jul 27, 2023
Training in progress, step 3000
9a34f1c
vincentmin
commited on
Jul 27, 2023
Training in progress, step 2500
1df8702
vincentmin
commited on
Jul 27, 2023
Training in progress, step 2000
ec04b62
vincentmin
commited on
Jul 27, 2023
Training in progress, step 1500
d83d4e2
vincentmin
commited on
Jul 27, 2023
Training in progress, step 1000
24f814f
vincentmin
commited on
Jul 27, 2023
Training in progress, step 500
40f0992
vincentmin
commited on
Jul 27, 2023
initial commit
6b81956
vincentmin
commited on
Jul 24, 2023