Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,45 @@ language:
|
|
6 |
library_name: transformers
|
7 |
---
|
8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
|
10 |
## Examples
|
11 |
```python
|
|
|
6 |
library_name: transformers
|
7 |
---
|
8 |
|
9 |
+
Roberta base model trained on Azerbaijani subset of OSCAR corpus.
|
10 |
+
```json
|
11 |
+
attention_probs_dropout_prob:0.1
|
12 |
+
bos_token_id:0
|
13 |
+
classifier_dropout:null
|
14 |
+
eos_token_id:2
|
15 |
+
gradient_checkpointing:false
|
16 |
+
hidden_act:"gelu"
|
17 |
+
hidden_dropout_prob:0.1
|
18 |
+
hidden_size:768
|
19 |
+
initializer_range:0.02
|
20 |
+
intermediate_size:3072
|
21 |
+
layer_norm_eps:1e-12
|
22 |
+
max_position_embeddings:514
|
23 |
+
model_type:"roberta"
|
24 |
+
num_attention_heads:12
|
25 |
+
num_hidden_layers:6
|
26 |
+
pad_token_id:1
|
27 |
+
position_embedding_type:"absolute"
|
28 |
+
torch_dtype:"float32"
|
29 |
+
transformers_version:"4.10.0"
|
30 |
+
type_vocab_size:1
|
31 |
+
use_cache:true
|
32 |
+
vocab_size:52000
|
33 |
+
```
|
34 |
+
|
35 |
+
## Usage
|
36 |
+
```python
|
37 |
+
from transformers import AutoTokenizer, AutoModelWithLMHead
|
38 |
+
|
39 |
+
tokenizer = AutoTokenizer.from_pretrained("iamdenay/roberta-azerbaijani")
|
40 |
+
|
41 |
+
model = AutoModelWithLMHead.from_pretrained("iamdenay/roberta-azerbaijani")
|
42 |
+
```
|
43 |
+
```python
|
44 |
+
from transformers import pipeline
|
45 |
+
model_mask = pipeline('fill-mask', model='iamdenay/roberta-azerbaijani')
|
46 |
+
model_mask("Le tweet <mask>.")
|
47 |
+
```
|
48 |
|
49 |
## Examples
|
50 |
```python
|