nintwentydo
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -39,4 +39,6 @@ Chat template in chat_template.json uses the v7 instruct template:
|
|
39 |
*- If you have a better implementation of the tokenizer let me know and I'm happy to swap it out.*
|
40 |
*- As always pls respect the model license.*
|
41 |
|
42 |
-
|
|
|
|
|
|
39 |
*- If you have a better implementation of the tokenizer let me know and I'm happy to swap it out.*
|
40 |
*- As always pls respect the model license.*
|
41 |
|
42 |
+
|
43 |
+
## Quantizations
|
44 |
+
EXL2 quants are available in different sizes [here](https://huggingface.co/models?other=exl2&author=nintwentydo&search=pixtral-large). You'll need to use dev branch of [ExLlamaV2](https://github.com/turboderp/exllamav2/tree/dev) for vision input.
|