chrisociepa commited on
Commit
004188b
1 Parent(s): 9014f5a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -1,3 +1,28 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - pl
5
+ tags:
6
+ - qlora
7
+ - instruct
8
  ---
9
+
10
+ This repo contains a qlora adapter for [Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1), fine-tuned on a private instructions dataset, exclusively in Polish.
11
+
12
+ The fine-tuning took 1 hour on a single RTX 4090 with the following hyperparameters:
13
+
14
+ * context length: 2048
15
+ * batch_size: 16
16
+ * learning_rate: 0.0001
17
+ * lora_r: 64
18
+ * lora_alpha: 16
19
+ * lora_modules: all
20
+ * lora_dropout: 0.0
21
+ * weight_decay: 0.1
22
+ * max_grad_norm: 0.3
23
+ * double_quant, nf4
24
+ * optimizer: paged_adamw_32bit (beta2: 0.999)
25
+
26
+ During the training of the adapter, the instruction format was maintained. Each prompt should be surrounded by `[INST]` and `[\INST]` tokens to leverage instruction fine-tuning.
27
+
28
+ This adapter allows the model to speak Polish more accurately than vanilla [Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1).