Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ tags:
|
|
20 |
# Model Card for Model ID
|
21 |
|
22 |
<!-- Provide a quick summary of what the model is/does. -->
|
23 |
-
This is a quantized version of `Llama 3.1 70B Instruct`. Quantization to
|
24 |
|
25 |
|
26 |
|
|
|
20 |
# Model Card for Model ID
|
21 |
|
22 |
<!-- Provide a quick summary of what the model is/does. -->
|
23 |
+
This is a quantized version of `Llama 3.1 70B Instruct`. Quantization to **4-bit** using `bistandbytes` and `accelerate`.
|
24 |
|
25 |
|
26 |
|