raincandy-u commited on
Commit
ec7676f
·
verified ·
1 Parent(s): bf714b1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -3
README.md CHANGED
@@ -1,3 +1,46 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ # raincandy-u/TinyChat-1776K
6
+
7
+ A tiny LM trained on TinyChat dataset from scratch.
8
+
9
+ The aim is to try to achieve natural responses on the smallest possible model. Trained using a dataset of 3 year old children level English conversations.
10
+
11
+ Note: It has no world knowledge, so you should not ask it any intellectual questions.
12
+
13
+ ## Model Spec
14
+
15
+ ```
16
+ config = AutoConfig.for_model(
17
+ model_type="llama",
18
+ hidden_size=192,
19
+ intermediate_size=640,
20
+ num_attention_heads=16,
21
+ num_hidden_layers=3,
22
+ num_key_value_heads=4,
23
+ tie_word_embeddings=True,
24
+ vocab_size=2048,
25
+ max_position_embeddings=256
26
+ )
27
+ ```
28
+
29
+ ## Template
30
+
31
+ ```
32
+ <A>Hi, Tom. How are you? <end>
33
+ <B>I'm fine, thank you. And you? <end>
34
+ <A>Fine. What's your favorite color? <end>
35
+ <B>My favorite color is black. <end>
36
+ <A>Do you like cats? <end>
37
+ <B>
38
+ ```
39
+
40
+ ## Generation Param
41
+
42
+ ```
43
+ top_k=40,
44
+ top_p=0.8,
45
+ temperature=1
46
+ ```