haritzpuerto
/

LLaMA2-13B-dcot

Text Generation

Model card Files Files and versions Community

haritzpuerto commited on Jun 26

Commit

1e5afae

•

1 Parent(s): 2aa3092

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ widget:
 This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
-> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer and adding DCoT data to instruction tuning allows models to improve performance through self-correction.
 Stay tuned for the release of the paper!

 This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
+> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
 Stay tuned for the release of the paper!