shuvom
/

yuj-v1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

shuvom commited on Feb 13

Commit

f82f682

•

1 Parent(s): 777b8ba

Update README.md

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -7,4 +7,26 @@ tags:
 - quantization
 - shuvom/yuj-v1
 license: apache-2.0
----

 - quantization
 - shuvom/yuj-v1
 license: apache-2.0
+---
+# yuj-v1-GGUF
+- Model creator: [shuvom_](https://huggingface.co/shuvom)
+- Original model: [shuvom/yuj-v1](https://huggingface.co/shuvom/yuj-v1)
+<!-- description start -->
+## Description
+This repo contains GGUF format model files for [shuvom/yuj-v1](https://huggingface.co/shuvom/yuj-v1).
+<!-- description end -->
+<!-- README_GGUF.md-about-gguf start -->
+### About GGUF
+GGUF and GGML are file formats used for storing models for inference, especially in the context of language models like GPT (Generative Pre-trained Transformer).
+[more info](https://medium.com/@phillipgimmi/what-is-gguf-and-ggml-e364834d241c)
+## Provided files
+| Name | Quant method | Bits | Size | Max RAM required | Use case |
+| ---- | ---- | ---- | ---- | ---- | ----- |
+| [yuj-v1.Q4_K_M.gguf](https://huggingface.co/shuvom/yuj-v1-GGUF/blob/main/yuj-v1.Q4_K_M.gguf) | Q4_K_M | 4 | 4.37 GB| 6.87 GB | medium, balanced quality - recommended |