Spaces:

AstroMLab
/

AstroSage-8B

Runtime error

Tijmen2 commited on Nov 18, 2024

Commit

18cc8ee

verified ·

1 Parent(s): 0513c2f

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -12,16 +12,9 @@ model_path = hf_hub_download(
 llm = Llama(
     model_path=model_path,
-    #n_ctx=2048,
-    #n_threads=8,
     chat_format="llama-3",
-    #seed=42,
-    #f16_kv=True,
-    #logits_all=False,
-    #use_mmap=True,
-    #use_gpu=True,
-    #n_gpu_layers=-1,  # to ensure all layers are on GPU
-    #offload_kqv=True  # for better memory management
 )
 # Placeholder responses for when context is empty

 llm = Llama(
     model_path=model_path,
+    n_ctx=2048,
     chat_format="llama-3",
+    n_gpu_layers=-1,  # ensure all layers are on GPU
 )
 # Placeholder responses for when context is empty