Spaces:

Kvikontent
/

Llama-3-70B

Runtime error

Kvikontent commited on May 21, 2024

Commit

ff147c7

verified ·

1 Parent(s): fdfe19c

Create app.py

Files changed (1) hide show

app.py ADDED Viewed

+import gradio as gr
+from huggingface_hub import InferenceClient
+import spaces
+client = InferenceClient("meta-llama/Meta-Llama-3-70B-Instruct")
+messages=[]
+client.chat_completion(messages, max_tokens=1024)
+@spaces.GPU()
+def respond(prompt):
+    response = client.chat_completion(
+        model="meta-llama/Meta-Llama-3-70B-Instruct",
+        messages=messages,
+        max_tokens=500,
+    )
+    return response.content
+gr.ChatInterface(respond).launch()