bnjmnmarie commited on
Commit
37a224e
·
verified ·
1 Parent(s): b751fd4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -1
README.md CHANGED
@@ -1,4 +1,24 @@
1
  ---
 
 
2
  library_name: transformers
 
 
 
3
  license: llama3.3
4
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
  library_name: transformers
5
+ tags:
6
+ - auto-gptq
7
+ - AutoRound
8
  license: llama3.3
9
+ ---
10
+
11
+
12
+ ## Model Details
13
+
14
+ This is [meta-llama/Llama-3.3-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) quantized with [AutoRound](https://github.com/intel/auto-round/tree/main) (symmetric quantization) and serialized with the GPTQ format in 4-bit. The model has been created, tested, and evaluated by The Kaitchup.
15
+
16
+ Details on the quantization process and how to use the model here:
17
+ [How to Quantize and Run Llama 3.3 70B Instruct on Your GPU](https://kaitchup.substack.com/p/how-to-quantize-and-run-llama-33)
18
+
19
+ ![Llama 3.3 70B Instruct_ Zero-Shot MMLU (Accuracy).png](https://cdn-uploads.huggingface.co/production/uploads/64b93e6bd6c468ac7536607e/0UOK_IsUinziw4GUMqq3_.png)
20
+ ![Llama 3.3 70B Instruct_ Model Size.png](https://cdn-uploads.huggingface.co/production/uploads/64b93e6bd6c468ac7536607e/hB8AdtD1DfVC90kOkYowz.png)
21
+
22
+ - **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)
23
+ - **Language(s) (NLP):** English
24
+ - **License:** Llama 3.3