Spaces:

Warlord-K
/

AVA

Runtime error

Warlord-K commited on Oct 13, 2023

Commit

19b4eba

1 Parent(s): 01df155

Change LLaMA2-70B to Sheared 1.3B

Files changed (1) hide show

app.py CHANGED Viewed

@@ -34,7 +34,7 @@ torch_device = "cuda" if torch.cuda.is_available() else "cpu"
 print("Running on device:", torch_device)
 print("CPU threads:", torch.get_num_threads())
-model_id = "meta-llama/Llama-2-70b-chat-hf"
 biencoder = SentenceTransformer("intfloat/e5-large-v2", device=torch_device)
 cross_encoder = CrossEncoder("cross-encoder/ms-marco-MiniLM-L-12-v2", max_length=512, device=torch_device)

 print("Running on device:", torch_device)
 print("CPU threads:", torch.get_num_threads())
+model_id = "princeton-nlp/Sheared-LLaMA-1.3B"
 biencoder = SentenceTransformer("intfloat/e5-large-v2", device=torch_device)
 cross_encoder = CrossEncoder("cross-encoder/ms-marco-MiniLM-L-12-v2", max_length=512, device=torch_device)