Artples
/

L-MChat-Small

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

L-MChat-Small / README.md

Artples's picture

Update README.md

e6a4222 verified 9 months ago

|

1.34 kB

	---
	base_model:
	- rhysjones/phi-2-orange-v2
	- Weyaxi/Einstein-v4-phi2
	library_name: transformers
	tags:
	- mergekit
	- merge
	license: mit
	---
	## L-MChat-Small
	<div style="text-align:center;width:250px;height:250px;">
	<img src="https://cdn.lauche.eu/L-MChat-Series-Logo.jpeg" alt="L-MChat-Series-Logo"">
	</div>
	This was a test of mine how small merges perform, because there are a lot of 7b merges and higher but not a lot of 2b merges.

	### Merge Method

	This model was merged using the SLERP merge method.

	### Models Merged

	The following models were included in the merge:
	* [rhysjones/phi-2-orange-v2](https://huggingface.co/rhysjones/phi-2-orange-v2)
	* [Weyaxi/Einstein-v4-phi2](https://huggingface.co/Weyaxi/Einstein-v4-phi2)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	slices:
	- sources:
	- model: Weyaxi/Einstein-v4-phi2
	layer_range:
	- 0
	- 32
	- model: rhysjones/phi-2-orange-v2
	layer_range:
	- 0
	- 32
	merge_method: slerp
	base_model: rhysjones/phi-2-orange-v2
	parameters:
	t:
	- filter: self_attn
	value:
	- 0
	- 0.5
	- 0.3
	- 0.7
	- 1
	- filter: mlp
	value:
	- 1
	- 0.5
	- 0.7
	- 0.3
	- 0
	- value: 0.5
	dtype: bfloat16
	```

	## Usage

	Use it with the ChatML format, you can also use the Inference-API for this Model.