Update README.md
Browse files
README.md
CHANGED
@@ -85,7 +85,7 @@ Notes from previous model cards:
|
|
85 |
|
86 |
Note that `checkpoint_0` is the base model and `checkpoint_mistral` is OpenMath-Mistral-7B-v0.1-hf. Also note that after 800 steps the fine tuned model had better coverage than the much larger teacher model.
|
87 |
|
88 |
-
The zero shot performance is _not good_™, but this model could be used to quickly generate synthetic data since the coverage is decent. The uploaded model is checkpoint-2.6k.
|
89 |
|
90 |
|
91 |
|
|
|
85 |
|
86 |
Note that `checkpoint_0` is the base model and `checkpoint_mistral` is OpenMath-Mistral-7B-v0.1-hf. Also note that after 800 steps the fine tuned model had better coverage than the much larger teacher model.
|
87 |
|
88 |
+
The zero shot performance is _not good_™, but this model could be used to quickly generate synthetic data since the coverage is decent. The uploaded model is checkpoint-2.6k (best zero-shot performance and top 4 coverage).
|
89 |
|
90 |
|
91 |
|