Is it possible for us to try out the 4th model?
Hi Undi! I quite like this merge of yours, and think it's a pretty great model that really punches above it's weight in some areas. It's the only 12B model that's been able to pass some scenarios of mine, and I'm wondering if it's because of the private model trained on Claude data that you threw into the mixture.
Each of the 3 models available failed at the moment with some of my private tests in one regard or another, even at near 0 temperature, and all of them couldn't handle 1 scenario in particular when trying to continue a scene. Whether it be due to wrong hallucinations, infinite text replies, or some formatting issues. I tried doing a similar merge method to yours, substituting the 4th model for a variety of stuff on hand at the moment to replicate it, and none of them even came close it seems.
I think you might be secretly holding on to a gem of a model, even if it was only intended as filler. Thank you for reading this message, and I do hope you consider releasing that model out into the public!
You want Undi95/LocalC-12B-e2.0 ?
Undi95/LocalC-12B-e2.0
Undi95/LocalC-12B-e2.0-GGUF
Here you go. It's a Mistral-Nemo 12B model trained on Claude logs of 16k ctx.
Do whatever you want with it I guess haha
Bless you. π
You want Undi95/LocalC-12B-e2.0 ?
Yes! I've been banging my head in trying to find a way to make some private merges work, and only yours was able to finish the job. I came to the conclusion this must of been the secret sauce, lol. Really appreciate it, thanks!