Upload README.md
Browse files
README.md
CHANGED
@@ -23,8 +23,6 @@ datasets:
|
|
23 |
|
24 |
Mixtress 135M is a transformer model based upon the [Mixtral](https://huggingface.co/docs/transformers/en/model_doc/mixtral) architecture. It is the culmination of approximately 20 weeks of [Kaggle](https://kaggle.com) free hours, and 67 twelve-hour training runs.
|
25 |
|
26 |
-
The results are laughably bad. The model has massively overfit to the training data, and it saw far less tokens than other models of comparable size. But at least I can say we saw it through to completion!
|
27 |
-
|
28 |
## Training data
|
29 |
|
30 |
Mixtress was trained on a curated sampling of data from the following datasets:
|
@@ -68,10 +66,12 @@ All evaluations were done using the [Pythia evaluation harness](https://github.c
|
|
68 |
|
69 |
### Scores
|
70 |
|
71 |
-
| Model and Size
|
72 |
-
|
|
73 |
-
| gpt-neo-125m
|
74 |
-
|
|
|
|
|
|
75 |
|
76 |
## Join Us
|
77 |
|
|
|
23 |
|
24 |
Mixtress 135M is a transformer model based upon the [Mixtral](https://huggingface.co/docs/transformers/en/model_doc/mixtral) architecture. It is the culmination of approximately 20 weeks of [Kaggle](https://kaggle.com) free hours, and 67 twelve-hour training runs.
|
25 |
|
|
|
|
|
26 |
## Training data
|
27 |
|
28 |
Mixtress was trained on a curated sampling of data from the following datasets:
|
|
|
66 |
|
67 |
### Scores
|
68 |
|
69 |
+
| Model and Size | ARC-easy | ARC-challenge | HellaSwag | OpenBookQA | PiQA | TinyMMLU | TriviaQA | Winogrande |
|
70 |
+
| ------------------------- | ---------- | ------------- | ---------- | ---------- | ---------- | ---------- | -------- | ---------- |
|
71 |
+
| EleutherAI/gpt-neo-125m | 22.95% | N/A | 30.26% | N/A | N/A | N/A | N/A | N/A |
|
72 |
+
| HuggingFaceTB/SmolLM-135M | 43.99% | N/A | 42.30% | N/A | 69.60% | 30.23% | 4.11% | 52.70% |
|
73 |
+
| OpenAI/GPT2-137M | 31.09% | N/A | 29.76% | N/A | 62.51% | 26.29% | 0.49% | 49.72% |
|
74 |
+
| **UNSAFE/Mixtress-135M** | **29.21%** | **24.57%** | **26.99%** | **21.80** | **52.67%** | **31.71%** | **N/A** | **50.91%** |
|
75 |
|
76 |
## Join Us
|
77 |
|