Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dmariko
/
SmolLM-1.7B-Instruct-dpo-16k
like
0
TensorBoard
Safetensors
English
llama
trl
dpo
Generated from Trainer
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
main
SmolLM-1.7B-Instruct-dpo-16k
Commit History
Training in progress, epoch 6
7930469
verified
dmariko
commited on
Sep 17, 2024
Training in progress, epoch 6
a83ed4f
verified
dmariko
commited on
Sep 17, 2024
Training in progress, epoch 4
be83051
verified
dmariko
commited on
Sep 17, 2024
Training in progress, epoch 4
9e37c66
verified
dmariko
commited on
Sep 16, 2024
Training in progress, epoch 2
d8da104
verified
dmariko
commited on
Sep 16, 2024
Training in progress, epoch 2
83f3975
verified
dmariko
commited on
Sep 16, 2024
Training in progress, epoch 0
c59c85f
verified
dmariko
commited on
Sep 16, 2024
Update README.md
284fcaa
verified
dmariko
commited on
Sep 12, 2024
Upload tokenizer
5cd1246
verified
dmariko
commited on
Sep 12, 2024
Upload LlamaForCausalLM
00e8ac3
verified
dmariko
commited on
Sep 12, 2024
SmolLM-1.7B-Instruct-dpo-16k
7511c1d
verified
dmariko
commited on
Sep 12, 2024
initial commit
871483c
verified
dmariko
commited on
Sep 12, 2024