Book (사회과학, 기술과학, 철학, 법학, 예술 등) - 5000개 qlora max_seq_length=1024 num_train_epochs=3 per_device_train_batch_size=8 gradient_accumulation_steps=32, evaluation_strategy="steps" eval_steps=2000, logging_steps=25, optim="paged_adamw_8bit", learning_rate=2e-4, lr_scheduler_type="cosine", warmup_steps=10, warmup_ratio=0.05, report_to="tensorboard", weight_decay=0.01, max_steps=-1, | Model | rouge-1 | rouge-2 | rouge-l | |-------|---------|---------|---------| | **Book** | | | | | yanolja/EEVE-Korean-Instruct-2.8B-v1.0 | 0.2095 | 0.0866 | 0.1985 | | ryanu/EEVE-10.8-BOOK-v0.1 | 0.2454 | 0.1158 | 0.2404 | | meta-llama/llama-3-8b-instruct | 0.2137 | 0.0883 | 0.2020 | | meta-llama/llama-3-70b-instruct | 0.2269 | 0.0925 | 0.2186 | | **Paper** | | | | | yanolja/EEVE-Korean-Instruct-2.8B-v1.0 | 0.1934 | 0.0829 | 0.1832 | | meta-llama/llama-3-8b-instruct | 0.2044 | 0.0868 | 0.1895 | | meta-llama/llama-3-70b-instruct | 0.1935 | 0.0783 | 0.1836 |