lsmille
/

lora_evo_ta_all_layers_8

Generated from Trainer

Model card Files Files and versions Community

lsmille commited on May 28, 2024

Commit

7b89698

·

verified ·

1 Parent(s): eac323f

Update README.md

Files changed (1) hide show

README.md +20 -2

README.md CHANGED Viewed

@@ -20,7 +20,25 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -28,7 +46,7 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 ## Model description
+*BEST MODEL*
+lora_alpha = 32
+lora_dropout = 0.05
+lora_r = 16
+epochs = 3
+learning rate = 3e-4
+warmup_steps=0.5
+gradient_accumulation_steps = 1 <---- virtual batch of 1 (update every sample)
+train_batch = 1
+eval_batch = 1
 ## Intended uses & limitations
 ## Training and evaluation data
+in files
 ## Training procedure