georgesung commited on
Commit
bbdb2ff
1 Parent(s): baf1d07

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
  license: other
3
  datasets:
4
- - ehartford/wizard_vicuna_70k_unfiltered
5
  ---
6
 
7
  # Overview
8
- Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered).
9
  Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train.
10
 
11
  The version here is the fp16 HuggingFace model.
@@ -58,4 +58,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
58
  | TruthfulQA (0-shot) | 41.34 |
59
  | Winogrande (5-shot) | 74.11 |
60
  | GSM8K (5-shot) | 5.84 |
61
- | DROP (3-shot) | 5.69 |
 
1
  ---
2
  license: other
3
  datasets:
4
+ - georgesung/wizard_vicuna_70k_unfiltered
5
  ---
6
 
7
  # Overview
8
+ Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset (originally from [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered)).
9
  Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train.
10
 
11
  The version here is the fp16 HuggingFace model.
 
58
  | TruthfulQA (0-shot) | 41.34 |
59
  | Winogrande (5-shot) | 74.11 |
60
  | GSM8K (5-shot) | 5.84 |
61
+ | DROP (3-shot) | 5.69 |