ryanu
/

EEVE-10.8-BOOK-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

EEVE-10.8-BOOK-v0.1 / README.md

ryanu's picture

Update README.md

3d6665d verified 7 months ago

|

1.01 kB

	Book (사회과학, 기술과학, 철학, 법학, 예술 등) - 5000개


	qlora

	max_seq_length=1024


	num_train_epochs=3


	per_device_train_batch_size=8


	gradient_accumulation_steps=32,


	evaluation_strategy="steps"


	eval_steps=2000,


	logging_steps=25,


	optim="paged_adamw_8bit",


	learning_rate=2e-4,


	lr_scheduler_type="cosine",


	warmup_steps=10,


	warmup_ratio=0.05,


	report_to="tensorboard",


	weight_decay=0.01,


	max_steps=-1,


	\| Model \| rouge-1 \| rouge-2 \| rouge-l \|
	\|-------\|---------\|---------\|---------\|
	\| Book \| \| \| \|
	\| yanolja/EEVE-Korean-Instruct-2.8B-v1.0 \| 0.2095 \| 0.0866 \| 0.1985 \|
	\| ryanu/EEVE-10.8-BOOK-v0.1 \| 0.2454 \| 0.1158 \| 0.2404 \|
	\| meta-llama/llama-3-8b-instruct \| 0.2137 \| 0.0883 \| 0.2020 \|
	\| meta-llama/llama-3-70b-instruct \| 0.2269 \| 0.0925 \| 0.2186 \|
	\| Paper \| \| \| \|
	\| yanolja/EEVE-Korean-Instruct-2.8B-v1.0 \| 0.1934 \| 0.0829 \| 0.1832 \|
	\| meta-llama/llama-3-8b-instruct \| 0.2044 \| 0.0868 \| 0.1895 \|
	\| meta-llama/llama-3-70b-instruct \| 0.1935 \| 0.0783 \| 0.1836 \|