DavidAU's picture
Update README.md
b94fa1d verified
|
raw
history blame
1.18 kB
metadata
license: apache-2.0
language:
  - en
tags:
  - story
  - general usage
  - roleplay
  - creative
  - rp
  - fantasy
  - story telling
  - ultra high precision

NEO CLASS Ultra Quants for : L3-8B-Stheno-v3.2

Additional quants are uploading...

The NEO Class tech was created after countless investigations and over 120 lab experiments backed by real world testing and qualitative results.

NEO Class results:

Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.

In addition quants now operate above their "grade" so to speak :

IE: Q4 / IQ4 operate at Q5KM/Q6 levels.

Likewise for Q3/IQ3 operate at Q4KM/Q5 levels.

The examples below illustrate the least amount of improvement using some of the lowest quants.

Perplexity drop of 1191 points for Neo Class Imatrix quant of IQ4XS VS regular quant of IQ4XS.

(lower is better)

Model Notes:

Maximum context is 8k. Please see original model maker's page for details, and usage information for this model.

Special thanks to the model creators at SAO10K for making such a fantastic model:

[ https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 ]