rAIfle
/

experiment_1_8b-fp16

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rAIfle commited on Apr 29, 2024

Commit

6cbc9ad

·

verified ·

1 Parent(s): c49981d

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -1,5 +1,21 @@
 1 epoch of grimulkan/LimaRP-augmented on LLaMA-8b via unsloth on colab, using the llama-chat template.
-```trainer = SFTTrainer(
     model = model,
     tokenizer = tokenizer,
     train_dataset = dataset,

 1 epoch of grimulkan/LimaRP-augmented on LLaMA-8b via unsloth on colab, using the llama-chat template.
+```
+model = FastLanguageModel.get_peft_model(
+    model,
+    r = 64, # Choose any number > 0 ! Suggested 8, 16, 32, 64, 128
+    target_modules = ["q_proj", "k_proj", "v_proj", "o_proj",
+                      "gate_proj", "up_proj", "down_proj",],
+    lora_alpha = 16,
+    lora_dropout = 0, # Supports any, but = 0 is optimized
+    bias = "none",    # Supports any, but = "none" is optimized
+    # [NEW] "unsloth" uses 30% less VRAM, fits 2x larger batch sizes!
+    use_gradient_checkpointing = "unsloth", # True or "unsloth" for very long context
+    random_state = 3407,
+    use_rslora = True,  # We support rank stabilized LoRA
+    loftq_config = None, # And LoftQ
+)
+trainer = SFTTrainer(
     model = model,
     tokenizer = tokenizer,
     train_dataset = dataset,