eramax
/

Tess-XS-v1-3-yarn-128K-q8-gguf

Inference Endpoints

Model card Files Files and versions Community

eramax commited on Nov 27, 2023

Commit

43b2059

•

1 Parent(s): 69168cc

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -1,3 +1,28 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
 ---
+## Q8-GGUF for [migtissera/Tess-XS-v1-3-yarn-128K ](https://huggingface.co/migtissera/Tess-XS-v1-3-yarn-128K)
+# Note:
+This version is the stable release. The issues that were present in versions 1.0, 1.1 and 1.2 all have been rectified.  Thank you for your patience while R&D was conducted. Enjoy!
+This model have been tested on context length up to 16K. Model produced slight repetition around 16K context length. I recommend testing the model to your usecase and limiting the context length.
+Here's my learnings going from Tess-v1.0 to Tess-v1.3: https://migel.substack.com/p/learnings-from-training-tess
+# Tess
+![Tess](https://huggingface.co/migtissera/Tess-M-v1.0/resolve/main/Tess.png)
+Tess, short for Tessoro/Tessoso, is a general purpose Large Language Model series. Tess-XS-v1.3 was trained on the Nous Research Mistral-7B-yarn-128K base.
+# Prompt Format:
+```
+SYSTEM: <ANY SYSTEM CONTEXT>
+USER:
+ASSISTANT:
+```