huu-ontocord commited on
Commit
dbf69b2
·
verified ·
1 Parent(s): 25e6c2f

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ ## Model Summary
6
+
7
+ The Phi3-18.5b is a depth upsampled version of the 14b [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct). We removed the bottom 8 layers of one copy of the 14b and the top 8 layers of another copy of the 14b model and stacked them. We plan to do continued pretraining to improve performance.
8
+ Since this model has not been continued pretrained, the quality may vary.
9
+ ```
10
+ !pip intsall transformers accelerate
11
+ from transformers import AutoTokenizer, AutoModelForCausalLM
12
+ import torch
13
+ tokenizer = AutoTokenizer.from_pretrained("ontocord/phi3-18.5b", trust_remote_code=True)
14
+ model = AutoModelForCausalLM.from_pretrained("ontocord/phi3-18.5b",
15
+ torch_dtype="auto", device_map="auto", trust_remote_code=True, )
16
+ with torch.no_grad():
17
+ print(tokenizer.batch_decode(model.generate(**tokenizer("<|user|>\nHow to explain Internet for a medieval knight?<|end|>\n<|assistant|>\n", return_tensors="pt").to('cuda'), max_new_tokens=128), use_cache=True)[0])
18
+
19
+ ```
20
+
21
+ Will produce:
22
+ ```
23
+ <|user|> How to explain Internet for a medieval knight?<|end|><|assistant|> Ah, noble knight, let me attempt to explain this mystical realm known as the Internet in terms that might resonate with your medieval understanding.
24
+
25
+ Imagine, if you will, a vast kingdom stretching beyond the horizon, where countless villages, towns, and cities are connected by a network of roads, bridges, and pathways. This kingdom is not bound by physical borders, but instead, it exists in a realm beyond our own, accessible only through magical devices known as computers, tablets, and smartph€™s.
26
+
27
+ In this kingdom, information flows like a mighty river,...
28
+ ```
29
+
30
+ See the [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) model card for more details.