KaraKaraWitch
/

Llama-MagicalGirl

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-MagicalGirl / README.md

KaraKaraWitch's picture

Update README.md

2a75a91 verified 21 days ago

|

history blame contribute delete

2.38 kB

	---
	base_model:
	- SicariusSicariiStuff/Negative_LLAMA_70B
	- TheDrummer/Nautilus-70B-v0.1
	- KaraKaraWitch/Llama-3.X-Workout-70B
	- Steelskull/L3.3-Nevoria-R1-70b
	- Tarek07/Inception-LLaMa-70B
	library_name: transformers
	tags:
	- mergekit
	- merge

	---

	# MagicalGirl

	No image for this model. A auditory replacement has been [provided.](https://files.catbox.moe/7z2w06.flac)

	<audio id="player" controls>
	<source src="https://files.catbox.moe/7z2w06.flac" type="audio/flac" />
	</audio>
	<script src="https://cdn.plyr.io/3.7.8/plyr.js"></script>

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	### Sampling Settings

	I keep playing around with sampler settings more often than not due to model not being super creative or just overly verbose. Anyway, I landed on the following for this model:

	```
	Temperature: 1.4
	Min P: 0.03
	```

	This applies retroactively to [KaraKaraWitch/Llama-3.X-Workout-70B](https://huggingface.co/KaraKaraWitch/Llama-3.X-Workout-70B) as well.

	## Notes

	- Seems to fit my requirements for the most part. Not too sure how exactly others would feel but I find that this is the model I envisioned.
	- Without a system prompt, the model can get quite offensive and dark. Consider writing a simple system prompt before using.

	## Merge Details
	### Merge Method

	This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [KaraKaraWitch/Llama-3.X-Workout-70B](https://huggingface.co/KaraKaraWitch/Llama-3.X-Workout-70B) as a base.

	### Models Merged

	The following models were included in the merge:
	* [SicariusSicariiStuff/Negative_LLAMA_70B](https://huggingface.co/SicariusSicariiStuff/Negative_LLAMA_70B)
	* [TheDrummer/Nautilus-70B-v0.1](https://huggingface.co/TheDrummer/Nautilus-70B-v0.1)
	* [Steelskull/L3.3-Nevoria-R1-70b](https://huggingface.co/Steelskull/L3.3-Nevoria-R1-70b)
	* [Tarek07/Inception-LLaMa-70B](https://huggingface.co/Tarek07/Inception-LLaMa-70B)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: SicariusSicariiStuff/Negative_LLAMA_70B
	- model: TheDrummer/Nautilus-70B-v0.1
	- model: Tarek07/Inception-LLaMa-70B
	- model: Steelskull/L3.3-Nevoria-R1-70b

	merge_method: sce
	base_model: KaraKaraWitch/Llama-3.X-Workout-70B
	parameters:
	select_topk: 1.0
	dtype: bfloat16
	```