mshojaei77 commited on
Commit
4a2317f
·
verified ·
1 Parent(s): 2aa89c1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -18,6 +18,8 @@ tags:
18
  ### Model Description
19
  The `PersianBPETokenizer` is a custom tokenizer specifically designed for the Persian (Farsi) language. It leverages the Byte-Pair Encoding (BPE) algorithm to create a robust vocabulary that can effectively handle the unique characteristics of Persian text. This tokenizer is optimized for use with advanced language models like BERT and RoBERTa, making it a valuable tool for various Persian NLP tasks.
20
 
 
 
21
  ### Model Type
22
  - **Tokenization Algorithm**: Byte-Pair Encoding (BPE)
23
  - **Normalization**: NFD, StripAccents, Lowercase, Strip, Replace (ZWNJ)
 
18
  ### Model Description
19
  The `PersianBPETokenizer` is a custom tokenizer specifically designed for the Persian (Farsi) language. It leverages the Byte-Pair Encoding (BPE) algorithm to create a robust vocabulary that can effectively handle the unique characteristics of Persian text. This tokenizer is optimized for use with advanced language models like BERT and RoBERTa, making it a valuable tool for various Persian NLP tasks.
20
 
21
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6556b1bb85d43542fa1a8f91/lZJKqsi4BZ8mJiY_I-vhA.png)
22
+
23
  ### Model Type
24
  - **Tokenization Algorithm**: Byte-Pair Encoding (BPE)
25
  - **Normalization**: NFD, StripAccents, Lowercase, Strip, Replace (ZWNJ)