athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1 further pretrained on 1 epoch of the dirty stories from nothingiisreal/Reddit-Dirty-And-WritingPrompts, with all scores below 2 dropped.


Why do this? I have a niche use case where I cannot increase compute over 8b, and L3/3.1 are the only models in this size category that meet my needs for logic. However, both versions of L3/3.1 have the damn repetition/token overconfidence problem, and this is meant to disrupt that certainty without disrupting the model's ability to function.

By the way, I think it's the lm_head that is causing the looping, but it might be the embeddings being too separated. I'm not going to pay two more times to test them separately, however :p

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 20.74
IFEval (0-Shot) 45.21
BBH (3-Shot) 28.02
MATH Lvl 5 (4-Shot) 8.84
GPQA (0-shot) 5.59
MuSR (0-shot) 8.30
MMLU-PRO (5-shot) 28.50
Downloads last month
131
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit

Finetuned
(1)
this model
Merges
1 model
Quantizations
2 models

Evaluation results