Dampfinchen
/

Ultra-Instruct-12B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Ultra-Instruct-12B / README.md

Dampfinchen's picture

Update README.md

6ad5f3d verified 7 months ago

|

history blame contribute delete

1.79 kB

	---
	base_model:
	- Pyroserenus/Orthrus-12b-v0.8
	- nbeerbower/mistral-nemo-gutenberg-12B-v2
	- mergekit-community/Deutscher-Pantheon-12B
	- IntervitensInc/Mistral-Nemo-Base-2407-chatml
	library_name: transformers
	tags:
	- mergekit
	- merge
	license: cc-by-nc-4.0
	---
	# merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [IntervitensInc/Mistral-Nemo-Base-2407-chatml](https://huggingface.co/IntervitensInc/Mistral-Nemo-Base-2407-chatml) as a base.

	### Models Merged

	The following models were included in the merge:
	* [Pyroserenus/Orthrus-12b-v0.8](https://huggingface.co/Pyroserenus/Orthrus-12b-v0.8)
	* [nbeerbower/mistral-nemo-gutenberg-12B-v2](https://huggingface.co/nbeerbower/mistral-nemo-gutenberg-12B-v2)
	* [mergekit-community/Deutscher-Pantheon-12B](https://huggingface.co/mergekit-community/Deutscher-Pantheon-12B)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: mergekit-community/Deutscher-Pantheon-12B
	parameters:
	weight: 0.3
	density: 0.5
	- model: nbeerbower/mistral-nemo-gutenberg-12B-v2
	parameters:
	weight: 0.3
	density: 0.5
	- model: Pyroserenus/Orthrus-12b-v0.8
	parameters:
	weight: 0.6
	density: 0.5
	merge_method: dare_ties
	base_model: IntervitensInc/Mistral-Nemo-Base-2407-chatml
	dtype: bfloat16
	name: Ultra-Instruct-12B
	```

	Not responsible for what you do with it. Use with caution. WARNING: UNCENSORED. SMART.

	Use ChatML. Note: It appears it has trouble stopping. If you value extremly long replies, this might be the model for you.