bartowski
/

Meta-Llama-3-8B-Instruct-old-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

bartowski commited on Apr 22

Commit

dc510ca

•

1 Parent(s): cdd27da

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -188,9 +188,17 @@ extra_gated_button_content: Submit
 quantized_by: bartowski
 ---
-## Llamacpp iMatrix Quantizations of Meta-Llama-3-8B-Instruct
-<b>Warning: This conversion is based on the merged Llama 3 support in llama.cpp (release b6710), and you will need to update your inference tool to be on at least version 6710 of llama.cpp, this will vary across tools. Until then, you can use the older style with <|eot_id|> special = false [here](https://huggingface.co/bartowski/Meta-Llama-3-8B-Instruct-GGUF-old)</b>
 Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2710">b2710</a> for quantization.

 quantized_by: bartowski
 ---
+## Llamacpp imatrix Quantizations of Meta-Llama-3-8B-Instruct
+<b>This conversion is based on the merged Llama 3 support in llama.cpp (release b6710)</b>
+# Known working on:
+ - LM Studio 0.2.20
+# Confirmed not working on (as of April 21):
+ - text-generation-webui master/dev
+Any others unknown, feel free to comment
 Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2710">b2710</a> for quantization.