georgesung
commited on
Commit
•
bbdb2ff
1
Parent(s):
baf1d07
Update README.md
Browse files
README.md
CHANGED
@@ -1,11 +1,11 @@
|
|
1 |
---
|
2 |
license: other
|
3 |
datasets:
|
4 |
-
-
|
5 |
---
|
6 |
|
7 |
# Overview
|
8 |
-
Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered).
|
9 |
Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train.
|
10 |
|
11 |
The version here is the fp16 HuggingFace model.
|
@@ -58,4 +58,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
58 |
| TruthfulQA (0-shot) | 41.34 |
|
59 |
| Winogrande (5-shot) | 74.11 |
|
60 |
| GSM8K (5-shot) | 5.84 |
|
61 |
-
| DROP (3-shot) | 5.69 |
|
|
|
1 |
---
|
2 |
license: other
|
3 |
datasets:
|
4 |
+
- georgesung/wizard_vicuna_70k_unfiltered
|
5 |
---
|
6 |
|
7 |
# Overview
|
8 |
+
Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with an uncensored/unfiltered Wizard-Vicuna conversation dataset (originally from [ehartford/wizard_vicuna_70k_unfiltered](https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered)).
|
9 |
Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance, took ~19 hours to train.
|
10 |
|
11 |
The version here is the fp16 HuggingFace model.
|
|
|
58 |
| TruthfulQA (0-shot) | 41.34 |
|
59 |
| Winogrande (5-shot) | 74.11 |
|
60 |
| GSM8K (5-shot) | 5.84 |
|
61 |
+
| DROP (3-shot) | 5.69 |
|