euclaise
/

Memphis-CoT-3B

Text Generation

supertrainer2000

Model card Files Files and versions Community

euclaise commited on Jan 30, 2024

Commit

a6cb189

·

verified ·

1 Parent(s): 432a208

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -70,7 +70,7 @@ The format for TinyCoT was:
 Memphis outperforms human-data models that are over twice its size, along with SFT models of its size, and trades with the Zephyr DPO model. That said, Zephyr uses synthetic data, and *much* more of it.
-Note that BBH results have wide SEs, exceeding 16%.
 It is unclear why Zephyr performs so poorly on BBH. Perhaps it is overfit, or maybe there was an issue with vllm.

 Memphis outperforms human-data models that are over twice its size, along with SFT models of its size, and trades with the Zephyr DPO model. That said, Zephyr uses synthetic data, and *much* more of it.
+Note that BBH results have wide SEs, sometimes even exceeding 16%.
 It is unclear why Zephyr performs so poorly on BBH. Perhaps it is overfit, or maybe there was an issue with vllm.