OpenSourceRonin commited on
Commit
22749ce
1 Parent(s): 57dd1f1

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ ---
4
+
5
+ **Disclaimer**:
6
+
7
+ The model is reproduced based on the paper *VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models* [github](https://github.com/microsoft/vptq) and [arXiv](https://arxiv.org/abs/2409.17066)
8
+
9
+ The model itself is sourced from a community release.
10
+
11
+ It is intended only for experimental purposes.
12
+
13
+ Users are responsible for any consequences arising from the use of this model.
14
+
15
+ **Note**:
16
+
17
+ The PPL test results are for reference only and were collected using GPTQ testing script.
18
+
19
+ ```json
20
+ {
21
+ "ctx_2048": {
22
+ "wikitext2": 6.8713483810424805
23
+ },
24
+ "ctx_4096": {
25
+ "wikitext2": 6.460692882537842
26
+ },
27
+ "ctx_8192": {
28
+ "wikitext2": 6.278013229370117
29
+ }
30
+ }
31
+ ```