chlee10
/

T3Q-DPO-Mistral-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chlee10 commited on Mar 13, 2024

Commit

e265ca5

·

verified ·

1 Parent(s): 7c1a9ad

Update README.md

Files changed (1) hide show

README.md +54 -0

README.md CHANGED Viewed

@@ -1,3 +1,57 @@
 ---
 license: apache-2.0
 ---

 ---
+pipeline_tag: text-generation
 license: apache-2.0
+language:
+- en
+tags:
+- SOLAR-10.7B-v1.0
+- Open-platypus-Commercial
+base_model: upstage/SOLAR-10.7B-v1.0
+datasets:
+- kyujinpy/Open-platypus-Commercial
+model-index:
+- name: T3Q-platypus-SOLAR-10.7B-v1.0
+  results: []
 ---
+Update @ 2024.03.07
+## T3Q-platypus-SOLAR-10.7B-v1.0
+This model is a fine-tuned version of upstage/SOLAR-10.7B-v1.0
+**Model Developers** Chihoon Lee(chlee10), T3Q
+## Training hyperparameters
+The following hyperparameters were used during training:
+```python
+  # 데이터셋과 훈련 횟수와 관련된 하이퍼 파라미터
+  batch_size = 16
+  num_epochs = 1
+  micro_batch = 1
+  gradient_accumulation_steps = batch_size // micro_batch
+  # 훈련 방법에 대한 하이퍼 파라미터
+  cutoff_len = 4096
+  lr_scheduler = 'cosine'
+  warmup_ratio = 0.06 # warmup_steps = 100
+  learning_rate = 4e-4
+  optimizer = 'adamw_torch'
+  weight_decay = 0.01
+  max_grad_norm = 1.0
+  # LoRA config
+  lora_r = 16
+  lora_alpha = 16
+  lora_dropout = 0.05
+  lora_target_modules = ["gate_proj", "down_proj", "up_proj"]
+  # Tokenizer에서 나오는 input값 설정 옵션
+  train_on_inputs = False
+  add_eos_token = False
+  # NEFTune params
+  noise_alpha: int = 5
+```