Pretergeek
/

OpenChat-3.5-0106_32K-PoSE

Text Generation

text-generation-inference

Model card Files Files and versions Community

Pretergeek commited on Nov 2, 2024

Commit

6b0c5ca

·

verified ·

1 Parent(s): f7d288c

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -5,4 +5,14 @@ datasets:
 base_model:
 - openchat/openchat-3.5-0106
 library_name: transformers
----

 base_model:
 - openchat/openchat-3.5-0106
 library_name: transformers
+---
+<p align="center">
+  <a href="https://ko-fi.com/pretergeek">Buy me a Ko-Fi</a> •
+  <a href="https://patreon.com/Pretergeek">Support my work using Patreon</a>
+</p>
+# OpenChat-3.5-0106_32K-PoSE
+This model is [Openchat-3.5-0106](https://huggingface.co/openchat/openchat-3.5-0106) with the context length extended from 8192 tokens to 32768 tokens using [PoSE](https://huggingface.co/papers/2309.10400).
+The model was fine-tuned using [Rank-Stabilized LoRA](https://huggingface.co/blog/damjan-k/rslora) and the [LongAlpaca-12K](Yukang/LongAlpaca-12k) dataset. I hope to continue extending the context in future versions and then apply the same methods to my [upscaled versions of OpenChat-3.5](https://huggingface.co/collections/Pretergeek/openchat-35-0106-with-additional-layers-66a8d3262c7c3ebdd7783a29).