theuerc
/

TinyLlama-1.1B-Chat-Math-v1.0

Text Generation

text-generation-inference

Model card Files Files and versions Community

theuerc commited on May 5, 2024

Commit

c7c09b6

·

verified ·

1 Parent(s): 14aa14b

Update README.md

Files changed (1) hide show

README.md +23 -0

README.md CHANGED Viewed

@@ -68,6 +68,29 @@ Note that `checkpoint_0` is the base model and `checkpoint_mistral` is OpenMath-
 The performance is _not good_™, but this model could be used to quickly generate synthetic data, as the coverage is decent for this dataset. The uploaded model is checkpoint-2.6k.
 People involved in creating this fine tune:
 - Coulton Theuer [[email protected]]
 - Bret Ellenbogen [[email protected]]

 The performance is _not good_™, but this model could be used to quickly generate synthetic data, as the coverage is decent for this dataset. The uploaded model is checkpoint-2.6k.
+| Checkpoint | Coverage  |
+|------------|-----------|
+| 1600       | 0.890244  |
+| 2200       | 0.890244  |
+| 2400       | 0.890244  |
+| **2600**       | 0.878049  |
+| 1200       | 0.878049  |
+| 2800       | 0.853659  |
+| 2000       | 0.853659  |
+| 800        | 0.841463  |
+| 1000       | 0.829268  |
+| 1800       | 0.829268  |
+| 1400       | 0.817073  |
+| mistral    | 0.804878  |
+| 3000       | 0.780488  |
+| 600        | 0.768293  |
+| 400        | 0.731707  |
+| 200        | 0.682927  |
+| 0          | 0.000000  |
+Note that after 800 steps the fine tuned model had better coverage than the much larger teacher model.
 People involved in creating this fine tune:
 - Coulton Theuer [[email protected]]
 - Bret Ellenbogen [[email protected]]