Solshine
/

reflection-llama-3.1-8B-Solshine-trainround1-16bit

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Solshine commited on Sep 8

Commit

6130880

•

1 Parent(s): 367433c

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -10,6 +10,7 @@ tags:
 - llama
 - trl
 - sft
 ---
 # Uploaded  model
@@ -18,6 +19,8 @@ tags:
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - llama
 - trl
 - sft
+- reflection
 ---
 # Uploaded  model
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
+Inspired by and featuring the Reflection Tuning technique pioneered by Matt Shumer (possibly earlier innovated by the team at Anthropic.)
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)