yahma
/

alpaca-13b-lora

Model card Files Files and versions Community

yahma commited on Apr 4, 2023

Commit

175a00b

·

1 Parent(s): e54b448

Update README.md

Files changed (1) hide show

README.md +28 -0

README.md CHANGED Viewed

@@ -1,3 +1,31 @@
 ---
 license: mit
 ---

 ---
 license: mit
+datasets:
+- yahma/alpaca-cleaned
 ---
+This repo contains a low-rank adapter for LLaMA-13b fit on the Cleaned Alpaca dataset.
+This version of the weights was trained with the following hyperparameters:
+    Cleaned dataset: Snapshot April 2, 2023
+    Epochs: 3
+    Validation set size: 2000
+    Batch size: 128
+    Micro batch size: 8
+    Cutoff length: 512
+    Learning rate: 3e-4
+    Lora r: 16
+    Lora target modules: q_proj, k_proj, v_proj, o_proj
+That is:
+python finetune.py \
+    --base_model='decapoda-research/llama-13b-hf' \
+    --data_path 'yahma/alpaca-cleaned' \
+    --num_epochs=3 \
+    --cutoff_len=512 \
+    --output_dir='./lora-alpaca' \
+    --lora_target_modules='[q_proj,k_proj, v_proj, o_proj]' \
+    --lora_r=16 \
+    --val_set_size 2000 \
+    --micro_batch_size=8