Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ocisd4
/
mistral_tokenizer_ext
like
0
Follow
ocisd4
28
Model card
Files
Files and versions
Community
samleeasus
commited on
Jan 16, 2024
Commit
5ee61bd
·
verified
·
1 Parent(s):
4f9b914
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-1
README.md
CHANGED
Viewed
@@ -1,5 +1,5 @@
1
2
-
Mistral擴充詞表只包含教育部常用
8000
字
3
4
後面補了25個dummy token,補到64的倍數可以增加訓練效率
5
未來可以作為special token的預留空間
1
2
+
Mistral擴充詞表只包含教育部常用
4808
字
3
4
後面補了25個dummy token,補到64的倍數可以增加訓練效率
5
未來可以作為special token的預留空間