nintwentydo commited on
Commit
f0c5ac2
·
verified ·
1 Parent(s): aca8d98

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -39,4 +39,6 @@ Chat template in chat_template.json uses the v7 instruct template:
39
  *- If you have a better implementation of the tokenizer let me know and I'm happy to swap it out.*
40
  *- As always pls respect the model license.*
41
 
42
- Currently doing a fresh measurement run ahead of re-doing my exl2 quants which I'll upload. Apologies in advance if anything is wonky, tbh this is just a personal learning exercise for me and I decided to make this model my fixation to freshen up on my knowledge lol.
 
 
 
39
  *- If you have a better implementation of the tokenizer let me know and I'm happy to swap it out.*
40
  *- As always pls respect the model license.*
41
 
42
+
43
+ ## Quantizations
44
+ EXL2 quants are available in different sizes [here](https://huggingface.co/models?other=exl2&author=nintwentydo&search=pixtral-large). You'll need to use dev branch of [ExLlamaV2](https://github.com/turboderp/exllamav2/tree/dev) for vision input.