EhimeNLP commited on
Commit
f225980
·
1 Parent(s): 1383211

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Model description
2
+
3
+ We pretrained a RoBERTa-based Japanese masked language model on paper abstracts from the academic database CiNii Articles.
4
+ [A Japanese Masked Language Model for Academic Domain](https://aclanthology.org/2022.sdp-1.16/)
5
+
6
+ # Vocabulary
7
+ The vocabulary consists of 32000 tokens including subwords induced by the unigram language model of sentencepiece.
8
+
9
+ ---
10
+ license: apache-2.0
11
+ language:ja
12
+ ---