Solshine
/

reflection-llama-3.1-8B-Solshine-trainround1-16bit

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Solshine commited on Sep 8

Commit

01eb803

•

1 Parent(s): 9a04e13

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -22,6 +22,7 @@ datasets:
 - **Finetuned from model :** unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
 Inspired by and featuring the Reflection Tuning technique pioneered by Matt Shumer (possibly earlier innovated by the team at Anthropic.)
 **As per the inspiring model "mattshumer/Reflection-Llama-3.1-70B" (this mode was not used in the training process nor as a foundational model, but only served as inspiration) :**
 '''

 - **Finetuned from model :** unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
 Inspired by and featuring the Reflection Tuning technique pioneered by Matt Shumer (possibly earlier innovated by the team at Anthropic.)
+*To the authors' knowledge, this is the first "reflection tuned" Llama 3.1 8B LLM*
 **As per the inspiring model "mattshumer/Reflection-Llama-3.1-70B" (this mode was not used in the training process nor as a foundational model, but only served as inspiration) :**
 '''