1 contributor

History: 56 commits

datatab

q5_0: Higher accuracy, higher resource usage and slower inference.

a08886e verified 8 months ago

.gitattributes

4 kB

q5_0: Higher accuracy, higher resource usage and slower inference. 8 months ago
README.md

3.42 kB

Update README.md 9 months ago
YugoGPT-Quantized-GGUF.Q3_K_XS.gguf

3 GB
LFS

q3_k_xs" : "3-bit extra small quantization 9 months ago
YugoGPT-Quantized-GGUF.Q5_K_M.gguf

5.13 GB
LFS

Rename YugoGPT-Quantized-GGUF-unsloth.Q5_K_M.gguf to YugoGPT-Quantized-GGUF.Q5_K_M.gguf 9 months ago
YugoGPT-Quantized-GGUF.Q5_K_S.gguf

5 GB
LFS

Rename YugoGPT-Quantized-GGUF-unsloth.Q5_K_S.gguf to YugoGPT-Quantized-GGUF.Q5_K_S.gguf 9 months ago
YugoGPT-Quantized-GGUF.Q6_K.gguf

5.94 GB
LFS

Rename YugoGPT-Quantized-GGUF-unsloth.Q6_K.gguf to YugoGPT-Quantized-GGUF.Q6_K.gguf 9 months ago
YugoGPT-Quantized-GGUF.Q8_0.gguf

7.7 GB
LFS

Rename YugoGPT-Quantized-GGUF-unsloth.Q8_0.gguf to YugoGPT-Quantized-GGUF.Q8_0.gguf 9 months ago
YugoGPT-Quantized.GGUF.Q2_K.gguf

2.72 GB
LFS

q2_k: Uses Q4_K for the attention.vw and feed_forward.w2 tensors, Q2_K for the other tensors. 8 months ago
YugoGPT-Quantized.GGUF.Q3_K_L.gguf

3.82 GB
LFS

q3_k_l: Uses Q5_K for the attention.wv, attention.wo, and feed_forward.w2 tensors, else Q3_K 8 months ago
YugoGPT-Quantized.GGUF.Q3_K_M.gguf

3.52 GB
LFS

q3_k_m: Uses Q4_K for the attention.wv, attention.wo, and feed_forward.w2 tensors, else Q3_K 8 months ago
YugoGPT-Quantized.GGUF.Q3_K_S.gguf

3.16 GB
LFS

q3_k_s: Uses Q3_K for all tensors 8 months ago
YugoGPT-Quantized.GGUF.Q4_0.gguf

4.11 GB
LFS

q4_0: Original quant method, 4-bit. 8 months ago
YugoGPT-Quantized.GGUF.Q4_K_M.gguf

4.37 GB
LFS

q4_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q4_K 8 months ago
YugoGPT-Quantized.GGUF.Q4_K_S.gguf

4.14 GB
LFS

q4_k_s: Uses Q4_K for all tensors 8 months ago
YugoGPT-Quantized.GGUF.Q5_0.gguf

5 GB
LFS

q5_0: Higher accuracy, higher resource usage and slower inference. 8 months ago
config.json

31 Bytes

Create config.json 9 months ago