grimjim
/

magnum-consolidatum-v1-12b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

magnum-consolidatum-v1-12b / README.md

grimjim's picture

Initial release

5bd4051 4 months ago

|

1.4 kB

	---
	base_model:
	- anthracite-org/magnum-v4-12b
	- anthracite-org/magnum-v2.5-12b-kto
	library_name: transformers
	pipeline_tag: text-generation
	tags:
	- mergekit
	- merge
	- chat
	license: apache-2.0
	---
	# magnum-consolidatum-v1-12b

	This repo contains a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). Since the base models are different Anthracite releases dubbed "Magnum", a logical name for this type of merge is "Magnum Consolidatum".

	Both Mistral Nemo and ChatML prompt formats are supported by this model.
	Tested within 16k token context length using temperatures 0.65-1.0 and minP 0.01, sometimes with smooth sampling enabled (factor 0.23, curve 4.32). Text generation lengths lean to the longer side.

	## Merge Details
	### Merge Method

	This model was merged using the SLERP merge method.

	### Models Merged

	The following models were included in the merge:
	* [anthracite-org/magnum-v4-12b](https://huggingface.co/anthracite-org/magnum-v4-12b)
	* [anthracite-org/magnum-v2.5-12b-kto](https://huggingface.co/anthracite-org/magnum-v2.5-12b-kto)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: anthracite-org/magnum-v4-12b
	- model: anthracite-org/magnum-v2.5-12b-kto
	merge_method: slerp
	base_model: anthracite-org/magnum-v4-12b
	parameters:
	t:
	- value: 0.5
	dtype: bfloat16

	```