shanchen
/

llama3-8B-slerp-med-262k

Text Generation

gradientai/Llama-3-8B-Instruct-262k

johnsnowlabs/JSL-MedLlama-3-8B-v1.0

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama3-8B-slerp-med-262k / mergekit_config.yml

shanchen's picture

Upload folder using huggingface_hub

f8ea11c verified 7 months ago

history blame contribute delete

414 Bytes


	slices:
	- sources:
	- model: gradientai/Llama-3-8B-Instruct-262k
	layer_range: [0,32]
	- model: johnsnowlabs/JSL-MedLlama-3-8B-v1.0
	layer_range: [0,32]
	merge_method: slerp
	base_model: gradientai/Llama-3-8B-Instruct-262k
	parameters:
	t:
	- filter: self_attn
	value: [0.3, 0.5, 0.5, 0.7, 1]
	- filter: mlp
	value: [1, 0.7, 0.5, 0.5, 0.3]
	- value: 0.5
	dtype: bfloat16