File size: 1,630 Bytes
1d69c6b 6e4c0f0 1d69c6b 6e4c0f0 a1252d4 d3846c0 a1252d4 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
---
base_model:
- ToastyPigeon/ms3-roselily-instruct
library_name: transformers
tags:
- mergekit
- merge
---
# todo
make a model card and put a cute girl on it
# some info
Making this public so it can be tried and possibly merged if desired while I work on getting the energy to write a proper card.
Short list of things to know:
- This is a bunch of RP, story writing, etc. creative data applied to [ToastyPigeon/ms3-roselily-instruct](https://huggingface.co/ToastyPigeon/ms3-roselily-instruct).
- Instruct format: ChatML or Alpaca preferred, Tekken v7 possible
- ChatML tokens were assigned to unused tokens 20 and 21, this leaves all the tekken tokens intact so merges w/ tekken models are feasible
- Instruct-tuning phase did include Tekken v7 so the tokens are initialized and recognized, but I did not continue with it on the creative step because I do not like it for creative stuff (too restrictive with turn order)
- Feels a little less sensitive to samplers than Instruct-based MS3 models, but should probably still be used with conservative samplers
# chat templates
You may need to set `<|im_end|>` and/or `</s>` as stopping strings depending on which format you're using, the model generates both properly but tokenizers can be finicky about what they stop on by default
Alpaca w/ System
```
### System:
{system prompt}
### Instruction:
{user message}
### Response:
{model answer}</s>
```
ChatML
```
<|im_start|>system
{system prompt}<|im_end|>
<|im_start|>user
{user message}<|im_end|>
<|im_start|>assistant
{model answer}<|im_end|>
```
Also saw some completion training in chat mode and adventure mode. |