georgesung
/

llama2_7b_chat_uncensored

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

georgesung commited on Mar 7, 2024

Commit

bbdb2ff

·

verified ·

1 Parent(s): baf1d07

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -1,11 +1,11 @@
 ---
 license: other
 datasets:
-- ehartford/wizard_vicuna_70k_unfiltered
 ---
 # Overview
-Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered).
 Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train.
 The version here is the fp16 HuggingFace model.
@@ -58,4 +58,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
 | TruthfulQA (0-shot)   | 41.34   |
 | Winogrande (5-shot)   | 74.11   |
 | GSM8K (5-shot)        | 5.84        |
-| DROP (3-shot)         | 5.69         |

 ---
 license: other
 datasets:
+- georgesung/wizard_vicuna_70k_unfiltered
 ---
 # Overview
+Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset (originally from [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered)).
 Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train.
 The version here is the fp16 HuggingFace model.
 | TruthfulQA (0-shot)   | 41.34   |
 | Winogrande (5-shot)   | 74.11   |
 | GSM8K (5-shot)        | 5.84        |
+| DROP (3-shot)         | 5.69         |