tom1669
/

PhoGPT-4B-Chat

tom1669 commited on Feb 27, 2024

Commit

d9e9f35

verified ·

1 Parent(s): f0449b3

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md ADDED Viewed

+GGUF format files of the model vinai/PhoGPT-4B-Chat.
+I'm trying to get PhoGPT to work with llama-cpp and llama-cpp-python.
+I cannot get [nguyenviet/PhoGPT-4B-Chat-GGUF](https://huggingface.co/nguyenviet/PhoGPT-4B-Chat-GGUF) to work in Colab:
+```
+from llama_cpp import Llama
+llm = Llama.from_pretrained(
+    repo_id="nguyenviet/PhoGPT-4B-Chat-GGUF",
+    filename="*q3_k_m.gguf*",
+)
+...
+llama_model_load: error loading model: done_getting_tensors: wrong number of tensors; expected 388, got 387
+llama_load_model_from_file: failed to load model
+...
+```
+My [issue](https://github.com/VinAIResearch/PhoGPT/issues/22) was resolved (thanks to @nviet and @datquocnguyen), and I figure people want to try the model in Colab. So I created my own `GGUF` file.