[Llama 3.3] Model Rock Smashing
Collection
Merges of Recent Llama 3.3 models
•
6 items
•
Updated
No image for this model. A auditory replacement has been provided.
This is a merge of pre-trained language models created using mergekit.
I keep playing around with sampler settings more often than not due to model not being super creative or just overly verbose. Anyway, I landed on the following for this model:
Temperature: 1.4
Min P: 0.03
This applies retroactively to KaraKaraWitch/Llama-3.X-Workout-70B as well.
This model was merged using the SCE merge method using KaraKaraWitch/Llama-3.X-Workout-70B as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: SicariusSicariiStuff/Negative_LLAMA_70B
- model: TheDrummer/Nautilus-70B-v0.1
- model: Tarek07/Inception-LLaMa-70B
- model: Steelskull/L3.3-Nevoria-R1-70b
merge_method: sce
base_model: KaraKaraWitch/Llama-3.X-Workout-70B
parameters:
select_topk: 1.0
dtype: bfloat16