georgesung commited on
Commit
dd775f6
1 Parent(s): f7bf90b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -0
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ datasets:
4
+ - georgesung/OpenOrca_35k
5
+ ---
6
+
7
+ # Overview
8
+ Fine-tuned [Llama-2 7B](https://huggingface.co/TheBloke/Llama-2-7B-fp16) with a 35k subset of the OpenOrca dataset [georgesung/OpenOrca_35k](https://huggingface.co/datasets/georgesung/OpenOrca_35k).
9
+ Used QLoRA for fine-tuning. Trained for one epoch on a 24GB GPU (NVIDIA A10G) instance.
10
+
11
+ # Prompt style
12
+ The model was trained with the following prompt style:
13
+ ```
14
+ ### System:
15
+ You are a helpful AI assistant.
16
+
17
+ ### Instruction:
18
+ Hello
19
+
20
+ ### Response:
21
+ Hi, how can I help you?
22
+ ```
23
+
24
+ # Training code
25
+ Code used to train the model is available [here](https://github.com/georgesung/llm_qlora).
26
+
27
+ To reproduce the results:
28
+ ```
29
+ git clone https://github.com/georgesung/llm_qlora
30
+ cd llm_qlora
31
+ pip install -r requirements.txt
32
+ python train.py configs/llama2_7b_orca_35k.yaml
33
+ ```
34
+
35
+ # Fine-tuning guide
36
+ https://georgesung.github.io/ai/qlora-ift/