EEVE-10.8-BOOK-v0.1 / README.md
ryanu's picture
Update README.md
3d6665d verified
|
raw
history blame
1.01 kB
Book (์‚ฌํšŒ๊ณผํ•™, ๊ธฐ์ˆ ๊ณผํ•™, ์ฒ ํ•™, ๋ฒ•ํ•™, ์˜ˆ์ˆ  ๋“ฑ) - 5000๊ฐœ
qlora
max_seq_length=1024
num_train_epochs=3
per_device_train_batch_size=8
gradient_accumulation_steps=32,
evaluation_strategy="steps"
eval_steps=2000,
logging_steps=25,
optim="paged_adamw_8bit",
learning_rate=2e-4,
lr_scheduler_type="cosine",
warmup_steps=10,
warmup_ratio=0.05,
report_to="tensorboard",
weight_decay=0.01,
max_steps=-1,
| Model | rouge-1 | rouge-2 | rouge-l |
|-------|---------|---------|---------|
| **Book** | | | |
| yanolja/EEVE-Korean-Instruct-2.8B-v1.0 | 0.2095 | 0.0866 | 0.1985 |
| ryanu/EEVE-10.8-BOOK-v0.1 | 0.2454 | 0.1158 | 0.2404 |
| meta-llama/llama-3-8b-instruct | 0.2137 | 0.0883 | 0.2020 |
| meta-llama/llama-3-70b-instruct | 0.2269 | 0.0925 | 0.2186 |
| **Paper** | | | |
| yanolja/EEVE-Korean-Instruct-2.8B-v1.0 | 0.1934 | 0.0829 | 0.1832 |
| meta-llama/llama-3-8b-instruct | 0.2044 | 0.0868 | 0.1895 |
| meta-llama/llama-3-70b-instruct | 0.1935 | 0.0783 | 0.1836 |