theuerc
/

TinyLlama-1.1B-Chat-Math-v1.0

Text Generation

text-generation-inference

Model card Files Files and versions Community

theuerc commited on May 5, 2024

Commit

cfd84ae

·

verified ·

1 Parent(s): b3f8a34

Update README.md

Files changed (1) hide show

README.md +4 -6

README.md CHANGED Viewed

@@ -63,11 +63,6 @@ Notes from previous model cards:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64388bdd43d932c4623e4983/Qr7rvIms3AL67jltHBXnr.png)
-Note that `checkpoint_0` is the base model and `checkpoint_mistral` is OpenMath-Mistral-7B-v0.1-hf.
-The performance is _not good_™, but this model could be used to quickly generate synthetic data, as the coverage is decent for this dataset. The uploaded model is checkpoint-2.6k.
 | Checkpoint | Coverage  |
 |------------|-----------|
 | 1600       | 0.890244  |
@@ -88,7 +83,10 @@ The performance is _not good_™, but this model could be used to quickly genera
 | 200        | 0.682927  |
 | 0          | 0.000000  |
-Note that after 800 steps the fine tuned model had better coverage than the much larger teacher model.
 People involved in creating this fine tune:

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64388bdd43d932c4623e4983/Qr7rvIms3AL67jltHBXnr.png)
 | Checkpoint | Coverage  |
 |------------|-----------|
 | 1600       | 0.890244  |
 | 200        | 0.682927  |
 | 0          | 0.000000  |
+Note that `checkpoint_0` is the base model and `checkpoint_mistral` is OpenMath-Mistral-7B-v0.1-hf. Also note that after 800 steps the fine tuned model had better coverage than the much larger teacher model.
+The performance is _not good_™, but this model could be used to quickly generate synthetic data, as the coverage is decent for this dataset. The uploaded model is checkpoint-2.6k.
 People involved in creating this fine tune: