flammenai
/

Mahou-1.3-llama3-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mahou-1.3-llama3-8B

Mahou is our attempt to build a production-ready conversational/roleplay LLM.

Future versions will be released iteratively and finetuned from flammen.ai conversational data.

License

This model is based on Meta Llama-3-8B and is governed by the META LLAMA 3 COMMUNITY LICENSE AGREEMENT.

Chat Format

This model has been trained to use ChatML format. Note the additional tokens in tokenizer_config.json.

<|im_start|>system
{{system}}<|im_end|>
<|im_start|>{{char}}
{{message}}<|im_end|>
<|im_start|>{{user}}
{{message}}<|im_end|>

Roleplay Format

Speech without quotes.
Actions in *asterisks*

*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.

ST Settings

Use ChatML for the Context Template.
Enable Instruct Mode.
Use the Mahou preset.
Recommended: Add newline as a stopping string: ["\n"]

Method

Finetuned for 10 epochs using an A100 on Google Colab.

Fine-tune Llama 3 with ORPO - Maxime Labonne

Downloads last month: 9

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for flammenai/Mahou-1.3-llama3-8B

Base model

nbeerbower/llama-3-Daredevil-Mahou-8B

Finetuned

(1)

this model

Finetunes

1 model

Merges

Quantizations

Dataset used to train flammenai/Mahou-1.3-llama3-8B

Collection including flammenai/Mahou-1.3-llama3-8B

Mahou

flammen.ai's production model for casual conversation and character roleplay • 25 items • Updated 8 days ago • 4