pkupie
/

Llama-2-7b-bod

Model card Files Files and versions Community

Llama-2-7b-bod / README.md

KobayashiKanna01's picture

KobayashiKanna01

Update README.md

8d514e8 verified about 1 month ago

|

450 Bytes

	---
	license: llama2
	datasets:
	- pkupie/mc2_corpus
	- togethercomputer/RedPajama-Data-1T
	language:
	- en
	- bo
	base_model:
	- meta-llama/Llama-2-7b-hf
	---

	A continually pre-trained model based on Llama-2-7b-hf.

	We use the Tibetan texts in MC^2 and English texts in RedPajama with a proportion of 4:1 for training.

	#### Hyper-parameters:
	* lr: 3e-5
	* batch size: 1M (2K*512)
	* lr scheduler: cosine
	* min lr: 1e-6
	* lr decay iters: 10240