Text Generation
Transformers
GGUF
English
mergekit
Mixture of Experts
mixture of experts
Merge
8x8B
128k context
Llama3 MOE
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
swearing
rp
horror
Inference Endpoints
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -134,6 +134,9 @@ You can set the number of experts in LMStudio (https://lmstudio.ai) at the "load
|
|
134 |
|
135 |
For Text-Generation-Webui (https://github.com/oobabooga/text-generation-webui) you set the number of experts at the loading screen page.
|
136 |
|
|
|
|
|
|
|
137 |
For server.exe / Llama-server.exe (Llamacpp - https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md )
|
138 |
add the following to the command line to start the "llamacpp server" (CLI):
|
139 |
|
|
|
134 |
|
135 |
For Text-Generation-Webui (https://github.com/oobabooga/text-generation-webui) you set the number of experts at the loading screen page.
|
136 |
|
137 |
+
For KolboldCPP (https://github.com/LostRuins/koboldcpp) Version 1.8+ , on the load screen, click on "TOKENS",
|
138 |
+
you can set experts on this page, and the launch the model.
|
139 |
+
|
140 |
For server.exe / Llama-server.exe (Llamacpp - https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md )
|
141 |
add the following to the command line to start the "llamacpp server" (CLI):
|
142 |
|