Update README.md
Browse files
README.md
CHANGED
@@ -22,11 +22,13 @@ This repository includes the quantized models for the [ReMM V2.2](https://huggin
|
|
22 |
|
23 |
## Current models
|
24 |
|
25 |
-
| exl2 Quant | Model Branch | Model Size | Minimum VRAM (4096 Context) | BPW |
|
26 |
|-|-|-|-|-|
|
27 |
-
| 3-Bit | main | N/A |
|
28 |
-
|
|
29 |
-
|
|
|
|
|
|
30 |
|
31 |
### Note
|
32 |
|
|
|
22 |
|
23 |
## Current models
|
24 |
|
25 |
+
| exl2 Quant | Model Branch | Model Size | Minimum recommended VRAM GPU (4096 Context) | BPW |
|
26 |
|-|-|-|-|-|
|
27 |
+
| 3-Bit | main | N/A | 10GB GPU | 3.14 |
|
28 |
+
| 3-Bit | 3bit | 6.36 GB | 10GB GPU | 3.72 |
|
29 |
+
| 4-Bit | 4bit | 7.13 GB | 12GB GPU (10GB with swap) | 4.2 |
|
30 |
+
| 4-Bit | 4.6bit | 7.81 GB | 12GB GPU | 4.63 |
|
31 |
+
| 5-Bit | [Orang Baik's Repo](https://huggingface.co/R136a1/ReMM-v2.2-L2-13B-exl2) | 8.96 GB | 16GB GPU (12GB with swap) | 5.33 |
|
32 |
|
33 |
### Note
|
34 |
|