finaspirant commited on
Commit
b96c919
·
verified ·
1 Parent(s): 20f3cc4

End of training

Browse files
Files changed (2) hide show
  1. README.md +1 -3
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,7 +1,5 @@
1
  ---
2
  library_name: transformers
3
- license: mit
4
- base_model: finaspirant/HW2-supervised
5
  tags:
6
  - trl
7
  - reward-trainer
@@ -18,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # HW2-reward
20
 
21
- This model is a fine-tuned version of [finaspirant/HW2-supervised](https://huggingface.co/finaspirant/HW2-supervised) on the piqa dataset.
22
 
23
  ## Model description
24
 
 
1
  ---
2
  library_name: transformers
 
 
3
  tags:
4
  - trl
5
  - reward-trainer
 
16
 
17
  # HW2-reward
18
 
19
+ This model was trained from scratch on the piqa dataset.
20
 
21
  ## Model description
22
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:00d3a5c3c3d2e7d8ca793a7b5252ada3e2da9cab72d6a871aca8b64e93d6d04e
3
  size 497780432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:77d5d0d8fac5cf732a57b53f4d58b28d505324fd25a4234d5a4014b00751e9a8
3
  size 497780432