Update README.md
Browse files
README.md
CHANGED
@@ -68,7 +68,7 @@ The format for TinyCoT was:
|
|
68 |
| [**Memphis-CoT 3B**](https://hf.co/euclaise/memphis-cot-3b) | 3B | **Human**+OASST | Self-teaching | **13.8%** | *26.24%* | **38.24%** |
|
69 |
*5-shot, as performed automatically by LM Evaluation Harness bbh_cot_fewshot even with num_fewshot=0
|
70 |
|
71 |
-
Memphis outperforms human-data models that are over twice its size, along with SFT models of its size, and trades with the Zephyr DPO model. That said, Zephyr uses synthetic data, and *much* more of it.
|
72 |
|
73 |
Note that BBH results have wide SEs, sometimes even exceeding 16%.
|
74 |
|
|
|
68 |
| [**Memphis-CoT 3B**](https://hf.co/euclaise/memphis-cot-3b) | 3B | **Human**+OASST | Self-teaching | **13.8%** | *26.24%* | **38.24%** |
|
69 |
*5-shot, as performed automatically by LM Evaluation Harness bbh_cot_fewshot even with num_fewshot=0
|
70 |
|
71 |
+
Memphis outperforms other primarily-human-data models that are over twice its size, along with SFT models of its size, and trades with the Zephyr DPO model. That said, Zephyr uses synthetic data, and *much* more of it.
|
72 |
|
73 |
Note that BBH results have wide SEs, sometimes even exceeding 16%.
|
74 |
|