renillhuang
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -69,7 +69,7 @@ tags:
|
|
69 |
|Embedding Tying | False |
|
70 |
|Position embedding | RoPE |
|
71 |
|seq_len | 8192 |
|
72 |
-
|Vocabulary Size |
|
73 |
|
74 |
- Training hyper-parameters
|
75 |
- We use the AdamW optimizer with hyperparameters set to 𝛽1 = 0.9, 𝛽2 = 0.95, and a weight decay of 0.1.
|
|
|
69 |
|Embedding Tying | False |
|
70 |
|Position embedding | RoPE |
|
71 |
|seq_len | 8192 |
|
72 |
+
|Vocabulary Size | 113664 |
|
73 |
|
74 |
- Training hyper-parameters
|
75 |
- We use the AdamW optimizer with hyperparameters set to 𝛽1 = 0.9, 𝛽2 = 0.95, and a weight decay of 0.1.
|