akumaburn
/

Alpaca-Llama-3-8B-GGUF

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

akumaburn commited on Apr 25, 2024

Commit

e6ac807

·

verified ·

1 Parent(s): 971f8b5

Update README.md

Files changed (1) hide show

README.md +54 -2

README.md CHANGED Viewed

@@ -7,15 +7,67 @@ tags:
 - transformers
 - unsloth
 - llama
-- gguf
 base_model: unsloth/llama-3-8b-bnb-4bit
 ---
-# Uploaded  model
 - **Developed by:** akumaburn
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3-8b-bnb-4bit
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

 - transformers
 - unsloth
 - llama
+- trl
+- sft
 base_model: unsloth/llama-3-8b-bnb-4bit
+datasets:
+- yahma/alpaca-cleaned
 ---
+# Alpaca-Llama-3-8B
+- **Fine Tuned using dataset:** https://huggingface.co/datasets/yahma/alpaca-cleaned
+- **Epoch Count:** 1
+- **Step Count:** 6,470/6,470
+- **Batch Size:** 2
+- **Gradient Accumulation Steps:** 4
+- **Context Size:** 8192
+- **Num examples:** 51,760
+- **Trainable Parameters:** 41,943,040
+- **Learning Rate:** 0.00001
+- **Training Loss:** 0.960000
+- **Fined Tuned using:** Google Colab Pro (Nvidia T4 runtime)
 - **Developed by:** akumaburn
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3-8b-bnb-4bit
+- **Prompt Format:** Alpaca System (https://libertai.io/apis/text-generation/prompting.html)
+Full model can be found in https://huggingface.co/akumaburn/Alpaca-Llama-3-8B
+mistral-7b-openorca.Q8_0.gguf:
+- **MMLU-Test:**     Final result: **41.5836 +/- 0.4174**
+- **Arc-Easy:**      Final result: 72.6316 +/- 1.8691
+- **Truthful QA:**   Final result: **32.0685 +/- 1.6339**
+- **Arc-Challenge:** Final result: 48.8294 +/- 2.8956
+llama-3-8b-bnb-4bit.Q8_0.gguf:
+- **MMLU-Test:**     Final result: 40.4074 +/- 0.4156
+- **Arc-Easy:**      Final result: 73.8596 +/- 1.8421
+- **Truthful QA:**   Final result: 26.6830 +/- 1.5484
+- **Arc-Challenge:** Final result: 46.8227 +/- 2.8906
+Open_Orca_Llama-3-8B-unsloth.Q8_0.gguf:
+- **MMLU-Test:**     Final result: 39.3818 +/- 0.4138
+- **Arc-Easy:**      Final result: 67.3684 +/- 1.9656
+- **Truthful QA:**   Final result: 29.0086 +/- 1.5886
+- **Arc-Challenge:** Final result: 42.1405 +/- 2.8604
+**Alpaca-Llama-3-8B-GGUF-unsloth.Q8_0.gguf**:
+- **MMLU-Test:**     Final result: 40.6441 +/- 0.4160
+- **Arc-Easy:**      Final result: **77.5439 +/- 1.7494**
+- **Truthful QA:**   Final result: 29.7430 +/- 1.6003
+- **Arc-Challenge:** Final result: **50.5017 +/- 2.8963**
+Meta-Llama-3-8B.Q8_0.gguf:
+- **MMLU-Test:**     Final result: 40.8664 +/- 0.4163
+- **Arc-Easy:**      Final result: 74.3860 +/- 1.8299
+- **Truthful QA:**   Final result: 28.6414 +/- 1.5826
+- **Arc-Challenge:** Final result: 47.1572 +/- 2.8917
+Llama.cpp Options For Testing:
+--samplers "tfs;typical;temp" --draft 32 --ctx-size 8192 --temp 0.82 --tfs 0.8 --typical 1.1 --repeat-last-n 512 --batch-size 8192 --repeat-penalty 1.0 --n-gpu-layers 100 --threads 12
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.