finaspirant
/

HW2-reward

Text Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

finaspirant commited on Oct 8, 2024

Commit

b96c919

·

verified ·

1 Parent(s): 20f3cc4

End of training

Files changed (2) hide show

README.md +1 -3
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,7 +1,5 @@
 ---
 library_name: transformers
-license: mit
-base_model: finaspirant/HW2-supervised
 tags:
 - trl
 - reward-trainer
@@ -18,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 # HW2-reward
-This model is a fine-tuned version of [finaspirant/HW2-supervised](https://huggingface.co/finaspirant/HW2-supervised) on the piqa dataset.
 ## Model description

 ---
 library_name: transformers
 tags:
 - trl
 - reward-trainer
 # HW2-reward
+This model was trained from scratch on the piqa dataset.
 ## Model description

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:00d3a5c3c3d2e7d8ca793a7b5252ada3e2da9cab72d6a871aca8b64e93d6d04e
 size 497780432

 version https://git-lfs.github.com/spec/v1
+oid sha256:77d5d0d8fac5cf732a57b53f4d58b28d505324fd25a4234d5a4014b00751e9a8
 size 497780432