jondurbin
/

mpt-30b-qlora-compatible

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jondurbin commited on Jun 26, 2023

Commit

6d5b10d

·

1 Parent(s): 1cd6bd0

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ Differences in the qlora scripts:
 __I think there's a bug in gradient accumulation, so if you try this, maybe set gradient accumulation steps to 1__
 Full example of tuning (used for airoboros-mpt-30b-gpt4-1.4):
 ```
@@ -23,11 +25,11 @@ export WANDB_PROJECT=airoboros-mpt-30b-gpt4-1.4
 python qlora.py \
     --model_name_or_path ./mpt-30b \
     --output_dir ./$WANDB_PROJECT-checkpoints \
-    --num_train_epochs 3 \
     --logging_steps 1 \
     --save_strategy steps \
     --data_seed 11422 \
-    --save_steps 75 \
     --save_total_limit 3 \
     --evaluation_strategy "no" \
     --eval_dataset_size 2 \

 __I think there's a bug in gradient accumulation, so if you try this, maybe set gradient accumulation steps to 1__
+__5 epochs seemed to achieve the best results, but YMMV__
 Full example of tuning (used for airoboros-mpt-30b-gpt4-1.4):
 ```
 python qlora.py \
     --model_name_or_path ./mpt-30b \
     --output_dir ./$WANDB_PROJECT-checkpoints \
+    --num_train_epochs 5 \
     --logging_steps 1 \
     --save_strategy steps \
     --data_seed 11422 \
+    --save_steps 100 \
     --save_total_limit 3 \
     --evaluation_strategy "no" \
     --eval_dataset_size 2 \