hermeschen1116
/

response_generator_for_emotion_chat_bot

Text Generation

4-bit precision

Model card Files Files and versions Community

hermeschen1116 commited on Jun 27

Commit

f86dbf9

•

1 Parent(s): 2068ebb

Update README.md

Files changed (1) hide show

README.md +11 -2

README.md CHANGED Viewed

@@ -7,7 +7,6 @@ tags:
 - trl
 - sft
 - unsloth
-- generated_from_trainer
 model-index:
 - name: response_generator_for_emotion_chat_bot
   results: []
@@ -19,7 +18,7 @@ pipeline_tag: text-generation
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# response_generator_for_emotion_chat_bot
 This model is a fine-tuned version of [unsloth/llama-2-7b-bnb-4bit](https://huggingface.co/unsloth/llama-2-7b-bnb-4bit) on [hermeschen1116/daily_dialog_for_RG](https://huggingface.co/datasets/hermeschen1116/daily_dialog_for_RG), self modified version of [daily_dialog](li2017dailydialog/daily_dialog).
@@ -40,7 +39,12 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
 - train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
@@ -48,6 +52,11 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 1
 ### Framework versions

 - trl
 - sft
 - unsloth
 model-index:
 - name: response_generator_for_emotion_chat_bot
   results: []
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Response Generator for [Emotion Chat Bot](https://github.com/hermeschen1116/chat-bot)
 This model is a fine-tuned version of [unsloth/llama-2-7b-bnb-4bit](https://huggingface.co/unsloth/llama-2-7b-bnb-4bit) on [hermeschen1116/daily_dialog_for_RG](https://huggingface.co/datasets/hermeschen1116/daily_dialog_for_RG), self modified version of [daily_dialog](li2017dailydialog/daily_dialog).
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- system_prompt: ""
 - learning_rate: 0.0002
+- weight_decay: 0.001
+- max_grad_norm: 0.3
+- warmup_ratio: 0.03
+- max_steps: -1
 - train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 1
+- init_lora_weights: true
+- lora_rank: 16
+- lora_alpha: 16
+- lora_dropout: 0.1
+- use_rslora: true
 ### Framework versions