Spaces:

PEFT
/

causal-language-modeling

Running

stevhliu HF Staff

Jainish commited on 3 days ago

Commit

263af0d

verified ·

1 Parent(s): 617c1d0

Update lora_clm_with_additional_tokens.ipynb (#7)

- Update lora_clm_with_additional_tokens.ipynb (fedc56bcbdf2e1afa6750f530fb7f899a1dba71d)

Co-authored-by: Jainish Patel <[email protected]>

Files changed (1) hide show

lora_clm_with_additional_tokens.ipynb CHANGED Viewed

@@ -9,7 +9,7 @@
     "\n",
     "In this example, we will learn how to train a LoRA model when adding new tokens to the tokenizer and model. \n",
     "This is a common usecase when doing the following:\n",
-    "1. Instruction finetuning with new tokens beind added such as `<|user|>`, `<|assistant|>`, `<|system|>`, `</s>`, `<s>` to properly format the conversations\n",
     "2. Finetuning on a specific language wherein language specific tokens are added, e.g., korean tokens being added to vocabulary for finetuning LLM on Korean datasets.\n",
     "3. Instruction finetuning to return outputs in certain format to enable agent behaviour new tokens such as `<|FUNCTIONS|>`, `<|BROWSE|>`, `<|TEXT2IMAGE|>`, `<|ASR|>`, `<|TTS|>`, `<|GENERATECODE|>`, `<|RAG|>`.\n",
     "\n",

     "\n",
     "In this example, we will learn how to train a LoRA model when adding new tokens to the tokenizer and model. \n",
     "This is a common usecase when doing the following:\n",
+    "1. Instruction finetuning with new tokens being added such as `<|user|>`, `<|assistant|>`, `<|system|>`, `</s>`, `<s>` to properly format the conversations\n",
     "2. Finetuning on a specific language wherein language specific tokens are added, e.g., korean tokens being added to vocabulary for finetuning LLM on Korean datasets.\n",
     "3. Instruction finetuning to return outputs in certain format to enable agent behaviour new tokens such as `<|FUNCTIONS|>`, `<|BROWSE|>`, `<|TEXT2IMAGE|>`, `<|ASR|>`, `<|TTS|>`, `<|GENERATECODE|>`, `<|RAG|>`.\n",
     "\n",