Update README.md
Browse files
README.md
CHANGED
@@ -18,9 +18,10 @@ learning_rate = 1.5e-5
|
|
18 |
```
|
19 |
2. Due to nature of BPE (tiktoken), tokenizer expansion/resize is not very friendly to training. Use text based special tokens if you need/use extra tokens to avoid bad train/eval losses
|
20 |
|
21 |
-
|
22 |
|
23 |
-
1.
|
|
|
24 |
---
|
25 |
inference: false
|
26 |
license: other
|
|
|
18 |
```
|
19 |
2. Due to nature of BPE (tiktoken), tokenizer expansion/resize is not very friendly to training. Use text based special tokens if you need/use extra tokens to avoid bad train/eval losses
|
20 |
|
21 |
+
Quants:
|
22 |
|
23 |
+
1. 4bit gptq/marlin: https://huggingface.co/LnL-AI/dbrx-base-converted-v2-4bit-gptq-marlin
|
24 |
+
2. 4bit gptq/gptq: https://huggingface.co/LnL-AI/dbrx-base-converted-v2-4bit-gptq-gptq
|
25 |
---
|
26 |
inference: false
|
27 |
license: other
|