Spaces:

expandme-tech
/

SmallZOO-GGUFee-Llama

Running

expandme commited on Dec 4, 2024

Commit

95feeee

1 Parent(s): bba907b

Testing of GGUF Llama3.2 3B

Files changed (3) hide show

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ colorFrom: green
 colorTo: indigo
 sdk: gradio
 app_file: app.py
-pinned: true
 license: cc-by-sa-4.0
 short_description: SmallZOO runnigng SLMs directly on CPU with Llama.cpp&Python
 ---

 colorTo: indigo
 sdk: gradio
 app_file: app.py
+pinned: flase
 license: cc-by-sa-4.0
 short_description: SmallZOO runnigng SLMs directly on CPU with Llama.cpp&Python
 ---

app.py CHANGED Viewed

@@ -2,10 +2,9 @@ import gradio as gr
 from llama_cpp import Llama
 import requests
 llm = Llama.from_pretrained(
-    repo_id="cognitivecomputations/dolphin-2.9.2-qwen2-7b-gguf",
-    filename="*Q4_K_S.gguf",
     verbose=True,
     n_ctx=32768,
     n_threads=2,

 from llama_cpp import Llama
 import requests
 llm = Llama.from_pretrained(
+    repo_id="lmstudio-community/Llama-3.2-3B-Instruct-GGUF",
+    filename="*Q4_K_M.gguf",
     verbose=True,
     n_ctx=32768,
     n_threads=2,

models.lst ADDED Viewed


1	+ Stack of modesl to try:
2	+
3	+ https://huggingface.co/lmstudio-community/Llama-3.2-3B-Instruct-GGUF