This model has some tokenization problems on its own (tokensurgery with a shotgun was applied), but was meant to be used in a merge. use at your own risk.

image/png

Uses ChatML Formatting, Text completion preset here

image/png (Notes pulled from original card), [since the data is the same]: One off train most likely, this was done purely for internal testing purposes but seemed ok enough to release. I do not plan to offer any kind of extended support for using this model, so your mileage may vary depending on use and context size.

  • (Nemo 12B instruct as base)
  • 200k randomized subset of GU_instruct-Remastered-1.1, with a splash of 25k hathor/poppy sauce, slow cooked for 3 epochs on medium heat.
Downloads last month
15
Safetensors
Model size
12.2B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for Nitral-Archive/Captain_BMO-12B-ChatMLified

Merges
2 models