izaitova commited on
Commit
43489fc
·
verified ·
1 Parent(s): c25e2d9

End of training

Browse files
Files changed (4) hide show
  1. README.md +86 -0
  2. config.json +1 -1
  3. model.safetensors +1 -1
  4. training_args.bin +2 -2
README.md ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: ai-forever/ruBert-large
3
+ tags:
4
+ - generated_from_trainer
5
+ datasets:
6
+ - universal_dependencies
7
+ metrics:
8
+ - precision
9
+ - recall
10
+ - f1
11
+ - accuracy
12
+ model-index:
13
+ - name: ruBert-large-upos
14
+ results:
15
+ - task:
16
+ name: Token Classification
17
+ type: token-classification
18
+ dataset:
19
+ name: universal_dependencies
20
+ type: universal_dependencies
21
+ config: ru_syntagrus
22
+ split: validation
23
+ args: ru_syntagrus
24
+ metrics:
25
+ - name: Precision
26
+ type: precision
27
+ value: 0.7868756986512067
28
+ - name: Recall
29
+ type: recall
30
+ value: 0.7638984758119328
31
+ - name: F1
32
+ type: f1
33
+ value: 0.772014566268761
34
+ - name: Accuracy
35
+ type: accuracy
36
+ value: 0.8826334008229377
37
+ ---
38
+
39
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
40
+ should probably proofread and complete it, then remove this comment. -->
41
+
42
+ # ruBert-large-upos
43
+
44
+ This model is a fine-tuned version of [ai-forever/ruBert-large](https://huggingface.co/ai-forever/ruBert-large) on the universal_dependencies dataset.
45
+ It achieves the following results on the evaluation set:
46
+ - Loss: 0.8204
47
+ - Precision: 0.7869
48
+ - Recall: 0.7639
49
+ - F1: 0.7720
50
+ - Accuracy: 0.8826
51
+
52
+ ## Model description
53
+
54
+ More information needed
55
+
56
+ ## Intended uses & limitations
57
+
58
+ More information needed
59
+
60
+ ## Training and evaluation data
61
+
62
+ More information needed
63
+
64
+ ## Training procedure
65
+
66
+ ### Training hyperparameters
67
+
68
+ The following hyperparameters were used during training:
69
+ - learning_rate: 5e-05
70
+ - train_batch_size: 32
71
+ - eval_batch_size: 8
72
+ - seed: 42
73
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
74
+ - lr_scheduler_type: linear
75
+ - num_epochs: 20
76
+
77
+ ### Training results
78
+
79
+
80
+
81
+ ### Framework versions
82
+
83
+ - Transformers 4.39.3
84
+ - Pytorch 1.11.0a0+17540c5
85
+ - Datasets 2.21.0
86
+ - Tokenizers 0.15.2
config.json CHANGED
@@ -64,7 +64,7 @@
64
  "pooler_type": "first_token_transform",
65
  "position_embedding_type": "absolute",
66
  "torch_dtype": "float32",
67
- "transformers_version": "4.42.4",
68
  "type_vocab_size": 2,
69
  "use_cache": true,
70
  "vocab_size": 120138
 
64
  "pooler_type": "first_token_transform",
65
  "position_embedding_type": "absolute",
66
  "torch_dtype": "float32",
67
+ "transformers_version": "4.39.3",
68
  "type_vocab_size": 2,
69
  "use_cache": true,
70
  "vocab_size": 120138
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c49656c4967dbf410bdd14dc09d04c41528b97b922960fc1929767b9bb9d391e
3
  size 1703557144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0900adac6efcb8595ffe8b5026352fc9433b58d31ee333ca42e1dd28ad0771bf
3
  size 1703557144
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4de3e820935fd16c854a942addec8e292768902422e540f09b25e8031221e855
3
- size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d4045894c22897b8c50f4fed02dba6e82b88bd5c681a35783e8aa0ff42641b7
3
+ size 4591