osanseviero commited on
Commit
07bde64
1 Parent(s): da62d3d

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -549,7 +549,7 @@ terms of this License.```
549
 
550
 
551
 
552
- # UD Romanian RRT v2.5
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
  * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
 
549
 
550
 
551
 
552
+ # UD Romanian RRT v2.8
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
  * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - token-classification
5
  language:
6
  - ro
7
- license: CC-BY-SA-4.0
8
  model-index:
9
  - name: ro_core_news_sm
10
  results:
@@ -14,47 +14,47 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7244897959
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7091817134
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7167540283
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9612171026
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
- value: 0.950928382
38
  - name: SENTER Recall
39
  type: recall
40
- value: 0.9534574468
41
  - name: SENTER F Score
42
  type: f_score
43
- value: 0.9521912351
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8705660969
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
- value: 0.8705660969
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_sm
60
 
@@ -63,12 +63,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_sm` |
66
- | **Version** | `3.1.0` |
67
- | **spaCy** | `>=3.1.0,<3.2.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
- | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.5](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,12 +76,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
76
 
77
  <details>
78
 
79
- <summary>View label scheme (534 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
- | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-p-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrln`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps2ms-s`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp1s`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp-sr`, `Yr` |
84
- | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:agent`, `nmod:pmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
@@ -92,15 +92,21 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
- | `TAG_ACC` | 96.12 |
96
- | `POS_ACC` | 95.56 |
97
- | `MORPH_ACC` | 96.24 |
98
- | `LEMMA_ACC` | 81.87 |
99
- | `DEP_UAS` | 87.06 |
100
- | `DEP_LAS` | 81.89 |
101
- | `SENTS_P` | 95.09 |
102
- | `SENTS_R` | 95.35 |
103
- | `SENTS_F` | 95.22 |
104
- | `ENTS_P` | 72.45 |
105
- | `ENTS_R` | 70.92 |
106
- | `ENTS_F` | 71.68 |
 
 
 
 
 
 
 
4
  - token-classification
5
  language:
6
  - ro
7
+ license: cc-by-sa-4.0
8
  model-index:
9
  - name: ro_core_news_sm
10
  results:
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7193937038
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.711102574
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7152241113
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
+ value: 0.9540563521
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
+ value: 0.9679572764
38
  - name: SENTER Recall
39
  type: recall
40
+ value: 0.9640957447
41
  - name: SENTER F Score
42
  type: f_score
43
+ value: 0.9660226516
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
+ value: 0.8691229961
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
+ value: 0.8691229961
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_sm
60
 
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_sm` |
66
+ | **Version** | `3.2.0` |
67
+ | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
+ | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
 
76
 
77
  <details>
78
 
79
+ <summary>View label scheme (541 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
+ | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
84
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
 
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
+ | `TOKEN_P` | 99.67 |
96
+ | `TOKEN_R` | 99.57 |
97
+ | `TOKEN_F` | 99.59 |
98
+ | `TAG_ACC` | 95.41 |
99
+ | `SENTS_P` | 96.80 |
100
+ | `SENTS_R` | 96.41 |
101
+ | `SENTS_F` | 96.60 |
102
+ | `DEP_UAS` | 86.91 |
103
+ | `DEP_LAS` | 81.14 |
104
+ | `POS_ACC` | 92.98 |
105
+ | `MORPH_ACC` | 93.87 |
106
+ | `MORPH_MICRO_P` | 98.33 |
107
+ | `MORPH_MICRO_R` | 95.07 |
108
+ | `MORPH_MICRO_F` | 96.37 |
109
+ | `LEMMA_ACC` | 81.83 |
110
+ | `ENTS_P` | 71.94 |
111
+ | `ENTS_R` | 71.11 |
112
+ | `ENTS_F` | 71.52 |
accuracy.json CHANGED
@@ -1,297 +1,194 @@
1
  {
2
  "token_acc": 0.9990029326,
3
- "tag_acc": 0.9612171026,
4
- "pos_acc": 0.9556105925,
5
- "morph_acc": 0.9623560794,
6
- "lemma_acc": 0.8186589263,
7
- "dep_uas": 0.8705660969,
8
- "dep_las": 0.8189173932,
9
- "sents_p": 0.950928382,
10
- "sents_r": 0.9534574468,
11
- "sents_f": 0.9521912351,
12
- "speed": 9573.3292142848,
13
- "morph_per_feat": {
14
- "Case": {
15
- "p": 0.985836417,
16
- "r": 0.9825766247,
17
- "f": 0.9842038217
18
- },
19
- "Gender": {
20
- "p": 0.9825625474,
21
- "r": 0.9721215152,
22
- "f": 0.9773141457
23
- },
24
- "Number": {
25
- "p": 0.9856352415,
26
- "r": 0.9757716212,
27
- "f": 0.98067863
28
- },
29
- "Person": {
30
- "p": 0.9789227166,
31
- "r": 0.9852681202,
32
- "f": 0.9820851689
33
- },
34
- "PronType": {
35
- "p": 0.9951523546,
36
- "r": 0.99308915,
37
- "f": 0.9941196818
38
- },
39
- "Polarity": {
40
- "p": 0.9918166939,
41
- "r": 0.993442623,
42
- "f": 0.9926289926
43
- },
44
- "AdpType": {
45
- "p": 0.9976311337,
46
- "r": 0.9976311337,
47
- "f": 0.9976311337
48
- },
49
- "Definite": {
50
- "p": 0.9854908915,
51
- "r": 0.9703174603,
52
- "f": 0.9778453171
53
- },
54
- "Degree": {
55
- "p": 0.9436038514,
56
- "r": 0.9214237743,
57
- "f": 0.9323819232
58
- },
59
- "VerbForm": {
60
- "p": 0.9632545932,
61
- "r": 0.9760638298,
62
- "f": 0.9696169089
63
- },
64
- "Abbr": {
65
- "p": 0.9804878049,
66
- "r": 0.8973214286,
67
- "f": 0.9370629371
68
- },
69
- "Poss": {
70
- "p": 1.0,
71
- "r": 0.9951807229,
72
- "f": 0.9975845411
73
- },
74
- "NumForm": {
75
- "p": 0.9944134078,
76
- "r": 0.9807162534,
77
- "f": 0.987517337
78
- },
79
- "NumType": {
80
- "p": 0.995821727,
81
- "r": 0.9821428571,
82
- "f": 0.9889349931
83
- },
84
- "Reflex": {
85
- "p": 1.0,
86
- "r": 0.9935897436,
87
- "f": 0.9967845659
88
- },
89
- "Strength": {
90
- "p": 0.9841269841,
91
- "r": 0.9841269841,
92
- "f": 0.9841269841
93
- },
94
- "Mood": {
95
- "p": 0.9446428571,
96
- "r": 0.9706422018,
97
- "f": 0.9574660633
98
- },
99
- "Tense": {
100
- "p": 0.9470672389,
101
- "r": 0.9622093023,
102
- "f": 0.9545782264
103
- },
104
- "Variant": {
105
- "p": 0.9867549669,
106
- "r": 0.9612903226,
107
- "f": 0.9738562092
108
- },
109
- "Position": {
110
- "p": 1.0,
111
- "r": 0.9910714286,
112
- "f": 0.9955156951
113
- },
114
- "Number[psor]": {
115
- "p": 1.0,
116
- "r": 0.9666666667,
117
- "f": 0.9830508475
118
- },
119
- "PartType": {
120
- "p": 1.0,
121
- "r": 0.9459459459,
122
- "f": 0.9722222222
123
- },
124
- "Foreign": {
125
- "p": 0.0,
126
- "r": 0.0,
127
- "f": 0.0
128
- }
129
- },
130
  "dep_las_per_type": {
131
  "root": {
132
- "p": 0.8519855596,
133
- "r": 0.8888888889,
134
- "f": 0.8700460829
135
  },
136
  "mark": {
137
- "p": 0.9138576779,
138
- "r": 0.920754717,
139
- "f": 0.9172932331
140
  },
141
  "case": {
142
- "p": 0.9520392749,
143
- "r": 0.9452568429,
144
- "f": 0.948635936
145
  },
146
  "nmod:tmod": {
147
- "p": 0.5151515152,
148
- "r": 0.1428571429,
149
- "f": 0.2236842105
150
  },
151
  "amod": {
152
- "p": 0.8744625967,
153
- "r": 0.8662691652,
154
- "f": 0.8703465982
155
  },
156
  "nsubj": {
157
- "p": 0.8336106489,
158
  "r": 0.7914691943,
159
- "f": 0.811993517
160
  },
161
  "nmod": {
162
- "p": 0.8005068791,
163
- "r": 0.7893609425,
164
- "f": 0.7948948409
165
  },
166
  "aux": {
167
- "p": 0.9757462687,
168
- "r": 0.9561243144,
169
- "f": 0.9658356417
170
  },
171
  "advcl": {
172
- "p": 0.5620915033,
173
- "r": 0.6466165414,
174
- "f": 0.6013986014
175
  },
176
  "obj": {
177
- "p": 0.7931769723,
178
- "r": 0.8591224018,
179
- "f": 0.8248337029
180
  },
181
  "det": {
182
- "p": 0.9598623853,
183
- "r": 0.9479048698,
184
- "f": 0.9538461538
185
  },
186
  "cc": {
187
- "p": 0.9182389937,
188
- "r": 0.9144050104,
189
- "f": 0.9163179916
190
  },
191
  "conj": {
192
- "p": 0.5696969697,
193
- "r": 0.5446118192,
194
- "f": 0.5568720379
195
  },
196
  "nummod": {
197
- "p": 0.8888888889,
198
- "r": 0.8571428571,
199
- "f": 0.8727272727
200
  },
201
  "acl": {
202
- "p": 0.7819767442,
203
- "r": 0.6950904393,
204
- "f": 0.7359781122
205
  },
206
  "advmod": {
207
- "p": 0.7928388747,
208
- "r": 0.8179419525,
209
- "f": 0.8051948052
210
  },
211
  "obl": {
212
- "p": 0.6563500534,
213
- "r": 0.7804568528,
214
- "f": 0.7130434783
215
  },
216
  "expl:pass": {
217
- "p": 0.8372093023,
218
- "r": 0.6666666667,
219
- "f": 0.7422680412
220
  },
221
  "nsubj:pass": {
222
- "p": 0.8582089552,
223
- "r": 0.7012195122,
224
- "f": 0.7718120805
225
  },
226
  "fixed": {
227
- "p": 0.8381742739,
228
- "r": 0.8541226216,
229
- "f": 0.8460732984
230
  },
231
  "appos": {
232
- "p": 0.4291666667,
233
  "r": 0.393129771,
234
- "f": 0.4103585657
235
  },
236
  "parataxis": {
237
- "p": 0.2045454545,
238
- "r": 0.2571428571,
239
- "f": 0.2278481013
240
  },
241
  "aux:pass": {
242
- "p": 0.9256756757,
243
  "r": 0.9133333333,
244
- "f": 0.9194630872
245
  },
246
  "nmod:agent": {
247
- "p": 0.9462365591,
248
- "r": 0.7927927928,
249
- "f": 0.862745098
250
  },
251
  "ccomp": {
252
- "p": 0.8770491803,
253
- "r": 0.8294573643,
254
- "f": 0.8525896414
255
  },
256
  "nmod:pmod": {
257
- "p": 0.3555555556,
258
- "r": 0.2666666667,
259
- "f": 0.3047619048
260
  },
261
  "iobj": {
262
- "p": 0.7619047619,
263
- "r": 0.7901234568,
264
- "f": 0.7757575758
265
  },
266
  "flat": {
267
- "p": 0.7560321716,
268
- "r": 0.7421052632,
269
- "f": 0.7490039841
270
  },
271
  "cop": {
272
- "p": 0.7952755906,
273
- "r": 0.814516129,
274
- "f": 0.8047808765
275
  },
276
  "csubj": {
277
- "p": 0.7619047619,
278
- "r": 0.7619047619,
279
- "f": 0.7619047619
280
  },
281
  "dep": {
282
  "p": 0.0,
283
  "r": 0.0,
284
  "f": 0.0
285
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
286
  "expl:pv": {
287
- "p": 0.7532467532,
288
- "r": 0.8405797101,
289
- "f": 0.7945205479
290
  },
291
  "expl": {
292
- "p": 0.7037037037,
293
- "r": 0.7037037037,
294
- "f": 0.7037037037
295
  },
296
  "vocative": {
297
  "p": 0.0,
@@ -299,139 +196,258 @@
299
  "f": 0.0
300
  },
301
  "expl:poss": {
302
- "p": 0.9310344828,
303
- "r": 0.8709677419,
304
- "f": 0.9
305
  },
306
  "goeswith": {
307
- "p": 0.1,
308
- "r": 0.3333333333,
309
- "f": 0.1538461538
310
- },
311
- "xcomp": {
312
- "p": 0.325,
313
- "r": 0.4814814815,
314
- "f": 0.3880597015
315
- },
316
- "compound": {
317
- "p": 0.3333333333,
318
- "r": 0.5714285714,
319
- "f": 0.4210526316
320
- },
321
- "ccomp:pmod": {
322
  "p": 0.0,
323
  "r": 0.0,
324
  "f": 0.0
325
  },
 
 
 
 
 
326
  "orphan": {
327
  "p": 0.0,
328
  "r": 0.0,
329
  "f": 0.0
330
  },
331
  "expl:impers": {
332
- "p": 0.5,
333
- "r": 0.3333333333,
334
- "f": 0.4
 
 
 
 
 
335
  },
336
  "list": {
337
  "p": 0.0,
338
  "r": 0.0,
339
  "f": 0.0
340
  },
 
 
 
 
 
341
  "cc:preconj": {
342
  "p": 0.0,
343
  "r": 0.0,
344
  "f": 0.0
 
 
 
 
 
 
 
 
 
 
 
 
345
  },
346
- "csubj:pass": {
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
347
  "p": 0.0,
348
  "r": 0.0,
349
  "f": 0.0
350
  }
351
  },
352
- "ents_p": 0.7244897959,
353
- "ents_r": 0.7091817134,
354
- "ents_f": 0.7167540283,
 
355
  "ents_per_type": {
356
  "DATETIME": {
357
- "p": 0.7889273356,
358
- "r": 0.7944250871,
359
- "f": 0.7916666667
360
  },
361
  "ORGANIZATION": {
362
- "p": 0.6656050955,
363
- "r": 0.6656050955,
364
- "f": 0.6656050955
365
  },
366
  "FACILITY": {
367
- "p": 0.4953271028,
368
  "r": 0.4045801527,
369
- "f": 0.4453781513
370
  },
371
  "NUMERIC_VALUE": {
372
- "p": 0.875,
373
- "r": 0.9194915254,
374
- "f": 0.8966942149
375
  },
376
  "ORDINAL": {
377
- "p": 0.7962962963,
378
- "r": 0.7818181818,
379
- "f": 0.7889908257
380
  },
381
  "EVENT": {
382
- "p": 0.6551724138,
383
- "r": 0.5135135135,
384
- "f": 0.5757575758
385
  },
386
  "GPE": {
387
- "p": 0.8025477707,
388
- "r": 0.8689655172,
389
- "f": 0.8344370861
390
  },
391
  "PERSON": {
392
- "p": 0.675862069,
393
- "r": 0.6577181208,
394
- "f": 0.6666666667
395
  },
396
  "NAT_REL_POL": {
397
- "p": 0.8976377953,
398
- "r": 0.76,
399
- "f": 0.8231046931
400
  },
401
  "MONEY": {
402
- "p": 0.8867924528,
403
- "r": 0.8103448276,
404
- "f": 0.8468468468
405
  },
406
  "PRODUCT": {
407
- "p": 0.4424778761,
408
- "r": 0.3649635036,
409
- "f": 0.4
410
  },
411
  "LOC": {
412
- "p": 0.4461538462,
413
- "r": 0.3815789474,
414
- "f": 0.4113475177
415
  },
416
  "WORK_OF_ART": {
417
- "p": 0.24,
418
- "r": 0.3157894737,
419
- "f": 0.2727272727
420
  },
421
  "QUANTITY": {
422
- "p": 0.8518518519,
423
  "r": 0.8846153846,
424
- "f": 0.8679245283
425
  },
426
  "LANGUAGE": {
427
- "p": 0.5714285714,
428
- "r": 1.0,
429
- "f": 0.7272727273
430
  },
431
  "PERIOD": {
432
- "p": 0.8717948718,
433
- "r": 0.8095238095,
434
- "f": 0.8395061728
435
  }
436
- }
 
437
  }
 
1
  {
2
  "token_acc": 0.9990029326,
3
+ "token_p": 0.9967350492,
4
+ "token_r": 0.9957244934,
5
+ "token_f": 0.9959492157,
6
+ "tag_acc": 0.9540563521,
7
+ "sents_p": 0.9679572764,
8
+ "sents_r": 0.9640957447,
9
+ "sents_f": 0.9660226516,
10
+ "dep_uas": 0.8691229961,
11
+ "dep_las": 0.8114296468,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  "dep_las_per_type": {
13
  "root": {
14
+ "p": 0.8723021583,
15
+ "r": 0.9133709981,
16
+ "f": 0.8923643054
17
  },
18
  "mark": {
19
+ "p": 0.9135338346,
20
+ "r": 0.9169811321,
21
+ "f": 0.9152542373
22
  },
23
  "case": {
24
+ "p": 0.9534009771,
25
+ "r": 0.951256093,
26
+ "f": 0.9523273273
27
  },
28
  "nmod:tmod": {
29
+ "p": 0.6888888889,
30
+ "r": 0.2605042017,
31
+ "f": 0.3780487805
32
  },
33
  "amod": {
34
+ "p": 0.88,
35
+ "r": 0.8807495741,
36
+ "f": 0.8803746275
37
  },
38
  "nsubj": {
39
+ "p": 0.835,
40
  "r": 0.7914691943,
41
+ "f": 0.8126520681
42
  },
43
  "nmod": {
44
+ "p": 0.7983435362,
45
+ "r": 0.7915030346,
46
+ "f": 0.7949085694
47
  },
48
  "aux": {
49
+ "p": 0.9665427509,
50
+ "r": 0.9506398537,
51
+ "f": 0.9585253456
52
  },
53
  "advcl": {
54
+ "p": 0.5862068966,
55
+ "r": 0.6390977444,
56
+ "f": 0.6115107914
57
  },
58
  "obj": {
59
+ "p": 0.800443459,
60
+ "r": 0.8337182448,
61
+ "f": 0.8167420814
62
  },
63
  "det": {
64
+ "p": 0.9417142857,
65
+ "r": 0.933182333,
66
+ "f": 0.9374288965
67
  },
68
  "cc": {
69
+ "p": 0.9206680585,
70
+ "r": 0.9206680585,
71
+ "f": 0.9206680585
72
  },
73
  "conj": {
74
+ "p": 0.5717948718,
75
+ "r": 0.516801854,
76
+ "f": 0.5429093122
77
  },
78
  "nummod": {
79
+ "p": 0.8808777429,
80
+ "r": 0.8726708075,
81
+ "f": 0.8767550702
82
  },
83
  "acl": {
84
+ "p": 0.7312138728,
85
+ "r": 0.65374677,
86
+ "f": 0.690313779
87
  },
88
  "advmod": {
89
+ "p": 0.774535809,
90
+ "r": 0.7704485488,
91
+ "f": 0.7724867725
92
  },
93
  "obl": {
94
+ "p": 0.6485042735,
95
+ "r": 0.7703045685,
96
+ "f": 0.7041763341
97
  },
98
  "expl:pass": {
99
+ "p": 0.8367346939,
100
+ "r": 0.7592592593,
101
+ "f": 0.7961165049
102
  },
103
  "nsubj:pass": {
104
+ "p": 0.8206896552,
105
+ "r": 0.7256097561,
106
+ "f": 0.7702265372
107
  },
108
  "fixed": {
109
+ "p": 0.867803838,
110
+ "r": 0.8604651163,
111
+ "f": 0.864118896
112
  },
113
  "appos": {
114
+ "p": 0.4136546185,
115
  "r": 0.393129771,
116
+ "f": 0.4031311155
117
  },
118
  "parataxis": {
119
+ "p": 0.2272727273,
120
+ "r": 0.2857142857,
121
+ "f": 0.253164557
122
  },
123
  "aux:pass": {
124
+ "p": 0.9133333333,
125
  "r": 0.9133333333,
126
+ "f": 0.9133333333
127
  },
128
  "nmod:agent": {
129
+ "p": 0.0,
130
+ "r": 0.0,
131
+ "f": 0.0
132
  },
133
  "ccomp": {
134
+ "p": 0.8548387097,
135
+ "r": 0.8217054264,
136
+ "f": 0.837944664
137
  },
138
  "nmod:pmod": {
139
+ "p": 0.0,
140
+ "r": 0.0,
141
+ "f": 0.0
142
  },
143
  "iobj": {
144
+ "p": 0.6842105263,
145
+ "r": 0.6419753086,
146
+ "f": 0.6624203822
147
  },
148
  "flat": {
149
+ "p": 0.7493333333,
150
+ "r": 0.7394736842,
151
+ "f": 0.7443708609
152
  },
153
  "cop": {
154
+ "p": 0.8034188034,
155
+ "r": 0.7580645161,
156
+ "f": 0.7800829876
157
  },
158
  "csubj": {
159
+ "p": 1.0,
160
+ "r": 0.8095238095,
161
+ "f": 0.8947368421
162
  },
163
  "dep": {
164
  "p": 0.0,
165
  "r": 0.0,
166
  "f": 0.0
167
  },
168
+ "obl:agent": {
169
+ "p": 0.0,
170
+ "r": 0.0,
171
+ "f": 0.0
172
+ },
173
+ "xcomp": {
174
+ "p": 0.3658536585,
175
+ "r": 0.5555555556,
176
+ "f": 0.4411764706
177
+ },
178
+ "obl:pmod": {
179
+ "p": 0.0,
180
+ "r": 0.0,
181
+ "f": 0.0
182
+ },
183
  "expl:pv": {
184
+ "p": 0.7808219178,
185
+ "r": 0.8260869565,
186
+ "f": 0.8028169014
187
  },
188
  "expl": {
189
+ "p": 0.696969697,
190
+ "r": 0.8518518519,
191
+ "f": 0.7666666667
192
  },
193
  "vocative": {
194
  "p": 0.0,
 
196
  "f": 0.0
197
  },
198
  "expl:poss": {
199
+ "p": 1.0,
200
+ "r": 0.9677419355,
201
+ "f": 0.9836065574
202
  },
203
  "goeswith": {
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
204
  "p": 0.0,
205
  "r": 0.0,
206
  "f": 0.0
207
  },
208
+ "ccomp:pmod": {
209
+ "p": 0.0909090909,
210
+ "r": 0.3333333333,
211
+ "f": 0.1428571429
212
+ },
213
  "orphan": {
214
  "p": 0.0,
215
  "r": 0.0,
216
  "f": 0.0
217
  },
218
  "expl:impers": {
219
+ "p": 0.0,
220
+ "r": 0.0,
221
+ "f": 0.0
222
+ },
223
+ "compound": {
224
+ "p": 0.75,
225
+ "r": 0.4285714286,
226
+ "f": 0.5454545455
227
  },
228
  "list": {
229
  "p": 0.0,
230
  "r": 0.0,
231
  "f": 0.0
232
  },
233
+ "csubj:pass": {
234
+ "p": 0.0,
235
+ "r": 0.0,
236
+ "f": 0.0
237
+ },
238
  "cc:preconj": {
239
  "p": 0.0,
240
  "r": 0.0,
241
  "f": 0.0
242
+ }
243
+ },
244
+ "pos_acc": 0.9297553275,
245
+ "morph_acc": 0.9386929369,
246
+ "morph_micro_p": 0.9832628288,
247
+ "morph_micro_r": 0.9507350539,
248
+ "morph_micro_f": 0.9636742939,
249
+ "morph_per_feat": {
250
+ "Case": {
251
+ "p": 0.9877237852,
252
+ "r": 0.9823222689,
253
+ "f": 0.985015622
254
  },
255
+ "Gender": {
256
+ "p": 0.9819398838,
257
+ "r": 0.9719964996,
258
+ "f": 0.9769428912
259
+ },
260
+ "Number": {
261
+ "p": 0.9836679143,
262
+ "r": 0.9136205625,
263
+ "f": 0.9473511742
264
+ },
265
+ "Person": {
266
+ "p": 0.982373678,
267
+ "r": 0.9852681202,
268
+ "f": 0.9838187702
269
+ },
270
+ "PronType": {
271
+ "p": 0.9937629938,
272
+ "r": 0.991015895,
273
+ "f": 0.9923875433
274
+ },
275
+ "Polarity": {
276
+ "p": 0.9918433931,
277
+ "r": 0.9967213115,
278
+ "f": 0.9942763696
279
+ },
280
+ "AdpType": {
281
+ "p": 0.999660787,
282
+ "r": 0.9972927242,
283
+ "f": 0.9984753515
284
+ },
285
+ "Definite": {
286
+ "p": 0.9843724827,
287
+ "r": 0.9698412698,
288
+ "f": 0.9770528504
289
+ },
290
+ "Degree": {
291
+ "p": 0.9400413508,
292
+ "r": 0.916051041,
293
+ "f": 0.9278911565
294
+ },
295
+ "VerbForm": {
296
+ "p": 0.9644736842,
297
+ "r": 0.9747340426,
298
+ "f": 0.9695767196
299
+ },
300
+ "Abbr": {
301
+ "p": 0.9754901961,
302
+ "r": 0.8883928571,
303
+ "f": 0.9299065421
304
+ },
305
+ "Poss": {
306
+ "p": 1.0,
307
+ "r": 0.9855421687,
308
+ "f": 0.9927184466
309
+ },
310
+ "NumForm": {
311
+ "p": 1.0,
312
+ "r": 0.3223140496,
313
+ "f": 0.4875
314
+ },
315
+ "NumType": {
316
+ "p": 1.0,
317
+ "r": 0.3241758242,
318
+ "f": 0.489626556
319
+ },
320
+ "Reflex": {
321
+ "p": 1.0,
322
+ "r": 0.9935897436,
323
+ "f": 0.9967845659
324
+ },
325
+ "Strength": {
326
+ "p": 0.9919354839,
327
+ "r": 0.9761904762,
328
+ "f": 0.984
329
+ },
330
+ "Mood": {
331
+ "p": 0.9584086799,
332
+ "r": 0.9724770642,
333
+ "f": 0.9653916211
334
+ },
335
+ "Tense": {
336
+ "p": 0.9596541787,
337
+ "r": 0.9680232558,
338
+ "f": 0.9638205499
339
+ },
340
+ "Variant": {
341
+ "p": 0.9865771812,
342
+ "r": 0.9483870968,
343
+ "f": 0.9671052632
344
+ },
345
+ "Position": {
346
+ "p": 0.9910714286,
347
+ "r": 0.9910714286,
348
+ "f": 0.9910714286
349
+ },
350
+ "Number[psor]": {
351
+ "p": 1.0,
352
+ "r": 0.9,
353
+ "f": 0.9473684211
354
+ },
355
+ "PartType": {
356
+ "p": 0.9722222222,
357
+ "r": 0.9459459459,
358
+ "f": 0.9589041096
359
+ },
360
+ "Foreign": {
361
  "p": 0.0,
362
  "r": 0.0,
363
  "f": 0.0
364
  }
365
  },
366
+ "lemma_acc": 0.8183070924,
367
+ "ents_p": 0.7193937038,
368
+ "ents_r": 0.711102574,
369
+ "ents_f": 0.7152241113,
370
  "ents_per_type": {
371
  "DATETIME": {
372
+ "p": 0.7883959044,
373
+ "r": 0.8048780488,
374
+ "f": 0.7965517241
375
  },
376
  "ORGANIZATION": {
377
+ "p": 0.6325301205,
378
+ "r": 0.6687898089,
379
+ "f": 0.6501547988
380
  },
381
  "FACILITY": {
382
+ "p": 0.4862385321,
383
  "r": 0.4045801527,
384
+ "f": 0.4416666667
385
  },
386
  "NUMERIC_VALUE": {
387
+ "p": 0.9227467811,
388
+ "r": 0.9110169492,
389
+ "f": 0.9168443497
390
  },
391
  "ORDINAL": {
392
+ "p": 0.7857142857,
393
+ "r": 0.8,
394
+ "f": 0.7927927928
395
  },
396
  "EVENT": {
397
+ "p": 0.5294117647,
398
+ "r": 0.4864864865,
399
+ "f": 0.5070422535
400
  },
401
  "GPE": {
402
+ "p": 0.7953586498,
403
+ "r": 0.8666666667,
404
+ "f": 0.8294829483
405
  },
406
  "PERSON": {
407
+ "p": 0.6728187919,
408
+ "r": 0.6728187919,
409
+ "f": 0.6728187919
410
  },
411
  "NAT_REL_POL": {
412
+ "p": 0.8854961832,
413
+ "r": 0.7733333333,
414
+ "f": 0.8256227758
415
  },
416
  "MONEY": {
417
+ "p": 0.9230769231,
418
+ "r": 0.8275862069,
419
+ "f": 0.8727272727
420
  },
421
  "PRODUCT": {
422
+ "p": 0.427184466,
423
+ "r": 0.3211678832,
424
+ "f": 0.3666666667
425
  },
426
  "LOC": {
427
+ "p": 0.4782608696,
428
+ "r": 0.4342105263,
429
+ "f": 0.4551724138
430
  },
431
  "WORK_OF_ART": {
432
+ "p": 0.2222222222,
433
+ "r": 0.2105263158,
434
+ "f": 0.2162162162
435
  },
436
  "QUANTITY": {
437
+ "p": 0.7419354839,
438
  "r": 0.8846153846,
439
+ "f": 0.8070175439
440
  },
441
  "LANGUAGE": {
442
+ "p": 0.6,
443
+ "r": 0.75,
444
+ "f": 0.6666666667
445
  },
446
  "PERIOD": {
447
+ "p": 0.8378378378,
448
+ "r": 0.7380952381,
449
+ "f": 0.7848101266
450
  }
451
+ },
452
+ "speed": 8496.149813704
453
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -1,10 +1,8 @@
1
  [paths]
2
- train = "corpus/ro-dep-mixed/train.spacy"
3
- dev = "corpus/ro-dep-mixed/dev.spacy"
4
  vectors = null
5
- raw = null
6
  init_tok2vec = null
7
- vocab_data = null
8
 
9
  [system]
10
  gpu_allocator = null
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
24
 
25
  [components.attribute_ruler]
26
  factory = "attribute_ruler"
 
27
  validate = false
28
 
29
  [components.lemmatizer]
@@ -31,11 +30,13 @@ factory = "lemmatizer"
31
  mode = "lookup"
32
  model = null
33
  overwrite = false
 
34
 
35
  [components.ner]
36
  factory = "ner"
37
  incorrect_spans_key = null
38
  moves = null
 
39
  update_with_oracle_cut_size = 100
40
 
41
  [components.ner.model]
@@ -53,8 +54,8 @@ nO = null
53
  [components.ner.model.tok2vec.embed]
54
  @architectures = "spacy.MultiHashEmbed.v2"
55
  width = 96
56
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
57
- rows = [5000,2500,2500,2500]
58
  include_static_vectors = false
59
 
60
  [components.ner.model.tok2vec.encode]
@@ -69,6 +70,7 @@ factory = "parser"
69
  learn_tokens = false
70
  min_action_freq = 30
71
  moves = null
 
72
  update_with_oracle_cut_size = 100
73
 
74
  [components.parser.model]
@@ -87,6 +89,8 @@ upstream = "tok2vec"
87
 
88
  [components.senter]
89
  factory = "senter"
 
 
90
 
91
  [components.senter.model]
92
  @architectures = "spacy.Tagger.v1"
@@ -98,8 +102,8 @@ nO = null
98
  [components.senter.model.tok2vec.embed]
99
  @architectures = "spacy.MultiHashEmbed.v2"
100
  width = 16
101
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
102
- rows = [1000,500,500,500]
103
  include_static_vectors = false
104
 
105
  [components.senter.model.tok2vec.encode]
@@ -111,6 +115,8 @@ maxout_pieces = 2
111
 
112
  [components.tagger]
113
  factory = "tagger"
 
 
114
 
115
  [components.tagger.model]
116
  @architectures = "spacy.Tagger.v1"
@@ -130,8 +136,8 @@ factory = "tok2vec"
130
  [components.tok2vec.model.embed]
131
  @architectures = "spacy.MultiHashEmbed.v2"
132
  width = ${components.tok2vec.model.encode:width}
133
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
134
- rows = [5000,2500,2500,2500]
135
  include_static_vectors = false
136
 
137
  [components.tok2vec.model.encode]
@@ -145,22 +151,19 @@ maxout_pieces = 3
145
 
146
  [corpora.dev]
147
  @readers = "spacy.Corpus.v1"
148
- limit = 0
149
- max_length = 0
150
- path = ${paths:dev}
151
  gold_preproc = false
 
 
152
  augmenter = null
153
 
154
  [corpora.train]
155
  @readers = "spacy.Corpus.v1"
156
- path = ${paths:train}
157
- max_length = 5000
158
  gold_preproc = false
 
159
  limit = 0
160
-
161
- [corpora.train.augmenter]
162
- @augmenters = "spacy.lower_case.v1"
163
- level = 0.1
164
 
165
  [training]
166
  train_corpus = "corpora.train"
@@ -191,9 +194,8 @@ compound = 1.001
191
  t = 0.0
192
 
193
  [training.logger]
194
- @loggers = "spacy.WandbLogger.v1"
195
- project_name = "spacy-v3.0.0a2"
196
- remove_config_values = []
197
 
198
  [training.optimizer]
199
  @optimizers = "Adam.v1"
@@ -214,16 +216,17 @@ dep_las_per_type = null
214
  sents_p = null
215
  sents_r = null
216
  sents_f = 0.02
217
- lemma_acc = 0.33
218
- ents_f = 0.33
219
  ents_p = 0.0
220
  ents_r = 0.0
221
  ents_per_type = null
 
222
 
223
  [pretraining]
224
 
225
  [initialize]
226
- vocab_data = ${paths.vocab_data}
227
  vectors = ${paths.vectors}
228
  init_tok2vec = ${paths.init_tok2vec}
229
  before_init = null
 
1
  [paths]
2
+ train = null
3
+ dev = null
4
  vectors = null
 
5
  init_tok2vec = null
 
6
 
7
  [system]
8
  gpu_allocator = null
 
22
 
23
  [components.attribute_ruler]
24
  factory = "attribute_ruler"
25
+ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
 
30
  mode = "lookup"
31
  model = null
32
  overwrite = false
33
+ scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
 
35
  [components.ner]
36
  factory = "ner"
37
  incorrect_spans_key = null
38
  moves = null
39
+ scorer = {"@scorers":"spacy.ner_scorer.v1"}
40
  update_with_oracle_cut_size = 100
41
 
42
  [components.ner.model]
 
54
  [components.ner.model.tok2vec.embed]
55
  @architectures = "spacy.MultiHashEmbed.v2"
56
  width = 96
57
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
58
+ rows = [5000,2500,2500,2500,100]
59
  include_static_vectors = false
60
 
61
  [components.ner.model.tok2vec.encode]
 
70
  learn_tokens = false
71
  min_action_freq = 30
72
  moves = null
73
+ scorer = {"@scorers":"spacy.parser_scorer.v1"}
74
  update_with_oracle_cut_size = 100
75
 
76
  [components.parser.model]
 
89
 
90
  [components.senter]
91
  factory = "senter"
92
+ overwrite = false
93
+ scorer = {"@scorers":"spacy.senter_scorer.v1"}
94
 
95
  [components.senter.model]
96
  @architectures = "spacy.Tagger.v1"
 
102
  [components.senter.model.tok2vec.embed]
103
  @architectures = "spacy.MultiHashEmbed.v2"
104
  width = 16
105
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
106
+ rows = [1000,500,500,500,50]
107
  include_static_vectors = false
108
 
109
  [components.senter.model.tok2vec.encode]
 
115
 
116
  [components.tagger]
117
  factory = "tagger"
118
+ overwrite = false
119
+ scorer = {"@scorers":"spacy.tagger_scorer.v1"}
120
 
121
  [components.tagger.model]
122
  @architectures = "spacy.Tagger.v1"
 
136
  [components.tok2vec.model.embed]
137
  @architectures = "spacy.MultiHashEmbed.v2"
138
  width = ${components.tok2vec.model.encode:width}
139
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
140
+ rows = [5000,2500,2500,2500,100]
141
  include_static_vectors = false
142
 
143
  [components.tok2vec.model.encode]
 
151
 
152
  [corpora.dev]
153
  @readers = "spacy.Corpus.v1"
154
+ path = ${paths.dev}
 
 
155
  gold_preproc = false
156
+ max_length = 0
157
+ limit = 0
158
  augmenter = null
159
 
160
  [corpora.train]
161
  @readers = "spacy.Corpus.v1"
162
+ path = ${paths.train}
 
163
  gold_preproc = false
164
+ max_length = 0
165
  limit = 0
166
+ augmenter = null
 
 
 
167
 
168
  [training]
169
  train_corpus = "corpora.train"
 
194
  t = 0.0
195
 
196
  [training.logger]
197
+ @loggers = "spacy.ConsoleLogger.v1"
198
+ progress_bar = false
 
199
 
200
  [training.optimizer]
201
  @optimizers = "Adam.v1"
 
216
  sents_p = null
217
  sents_r = null
218
  sents_f = 0.02
219
+ lemma_acc = 0.5
220
+ ents_f = 0.16
221
  ents_p = 0.0
222
  ents_r = 0.0
223
  ents_per_type = null
224
+ speed = 0.0
225
 
226
  [pretraining]
227
 
228
  [initialize]
229
+ vocab_data = null
230
  vectors = ${paths.vectors}
231
  init_tok2vec = ${paths.init_tok2vec}
232
  before_init = null
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"ro",
3
  "name":"core_news_sm",
4
- "version":"3.1.0",
5
  "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"[email protected]",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.1.0,<3.2.0",
11
- "spacy_git_version":"caba63b74",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -30,6 +30,7 @@
30
  "Afp",
31
  "Afp-p-n",
32
  "Afp-poy",
 
33
  "Afpf--n",
34
  "Afpfp-n",
35
  "Afpfp-ny",
@@ -131,6 +132,7 @@
131
  "Ds2ms-s",
132
  "Ds3---p",
133
  "Ds3---s",
 
134
  "Ds3fp-s",
135
  "Ds3fsos",
136
  "Ds3fsrs",
@@ -159,18 +161,23 @@
159
  "LSQR",
160
  "LT",
161
  "M",
162
- "Mc",
163
  "Mc-p-d",
164
  "Mc-p-l",
 
 
 
165
  "Mcfp-l",
166
  "Mcfp-ln",
167
  "Mcfprln",
168
  "Mcfprly",
169
  "Mcfsoln",
 
170
  "Mcfsrln",
 
171
  "Mcmp-l",
172
  "Mcms-ln",
173
  "Mcmsrl",
 
174
  "Mcmsrly",
175
  "Mffprln",
176
  "Mffsrln",
@@ -243,7 +250,6 @@
243
  "Pd3mpr--y",
244
  "Pd3mso",
245
  "Pd3msr",
246
- "Pi3",
247
  "Pi3--r",
248
  "Pi3-po",
249
  "Pi3-so",
@@ -289,6 +295,7 @@
289
  "Pp3-po--------s",
290
  "Pp3-sd--------w",
291
  "Pp3-sd--y-----w",
 
292
  "Pp3fpa--------w",
293
  "Pp3fpa--y-----w",
294
  "Pp3fpr--------s",
@@ -315,7 +322,6 @@
315
  "Ps2fp-s",
316
  "Ps2fsrp",
317
  "Ps2fsrs",
318
- "Ps2ms-s",
319
  "Ps3---p",
320
  "Ps3---s",
321
  "Ps3fp-s",
@@ -348,7 +354,6 @@
348
  "RPAR",
349
  "RSQR",
350
  "Rc",
351
- "Rgc",
352
  "Rgp",
353
  "Rgpy",
354
  "Rgs",
@@ -406,6 +411,7 @@
406
  "Va--3s",
407
  "Va--3s----y",
408
  "Vag",
 
409
  "Vaii1",
410
  "Vaii2s",
411
  "Vaii3p",
@@ -475,7 +481,7 @@
475
  "Vmp--sm",
476
  "Vmp--sm---y",
477
  "Vmsp1p",
478
- "Vmsp1s",
479
  "Vmsp2s",
480
  "Vmsp3",
481
  "Vmsp3-----y",
@@ -488,6 +494,7 @@
488
  "Ynmsoy",
489
  "Ynmsry",
490
  "Yp",
 
491
  "Yp-sr",
492
  "Yr"
493
  ],
@@ -525,14 +532,14 @@
525
  "iobj",
526
  "mark",
527
  "nmod",
528
- "nmod:agent",
529
- "nmod:pmod",
530
  "nmod:tmod",
531
  "nsubj",
532
  "nsubj:pass",
533
  "nummod",
534
  "obj",
535
  "obl",
 
 
536
  "orphan",
537
  "parataxis",
538
  "punct",
@@ -590,298 +597,195 @@
590
  ],
591
  "performance":{
592
  "token_acc":0.9990029326,
593
- "tag_acc":0.9612171026,
594
- "pos_acc":0.9556105925,
595
- "morph_acc":0.9623560794,
596
- "lemma_acc":0.8186589263,
597
- "dep_uas":0.8705660969,
598
- "dep_las":0.8189173932,
599
- "sents_p":0.950928382,
600
- "sents_r":0.9534574468,
601
- "sents_f":0.9521912351,
602
- "speed":9573.3292142848,
603
- "morph_per_feat":{
604
- "Case":{
605
- "p":0.985836417,
606
- "r":0.9825766247,
607
- "f":0.9842038217
608
- },
609
- "Gender":{
610
- "p":0.9825625474,
611
- "r":0.9721215152,
612
- "f":0.9773141457
613
- },
614
- "Number":{
615
- "p":0.9856352415,
616
- "r":0.9757716212,
617
- "f":0.98067863
618
- },
619
- "Person":{
620
- "p":0.9789227166,
621
- "r":0.9852681202,
622
- "f":0.9820851689
623
- },
624
- "PronType":{
625
- "p":0.9951523546,
626
- "r":0.99308915,
627
- "f":0.9941196818
628
- },
629
- "Polarity":{
630
- "p":0.9918166939,
631
- "r":0.993442623,
632
- "f":0.9926289926
633
- },
634
- "AdpType":{
635
- "p":0.9976311337,
636
- "r":0.9976311337,
637
- "f":0.9976311337
638
- },
639
- "Definite":{
640
- "p":0.9854908915,
641
- "r":0.9703174603,
642
- "f":0.9778453171
643
- },
644
- "Degree":{
645
- "p":0.9436038514,
646
- "r":0.9214237743,
647
- "f":0.9323819232
648
- },
649
- "VerbForm":{
650
- "p":0.9632545932,
651
- "r":0.9760638298,
652
- "f":0.9696169089
653
- },
654
- "Abbr":{
655
- "p":0.9804878049,
656
- "r":0.8973214286,
657
- "f":0.9370629371
658
- },
659
- "Poss":{
660
- "p":1.0,
661
- "r":0.9951807229,
662
- "f":0.9975845411
663
- },
664
- "NumForm":{
665
- "p":0.9944134078,
666
- "r":0.9807162534,
667
- "f":0.987517337
668
- },
669
- "NumType":{
670
- "p":0.995821727,
671
- "r":0.9821428571,
672
- "f":0.9889349931
673
- },
674
- "Reflex":{
675
- "p":1.0,
676
- "r":0.9935897436,
677
- "f":0.9967845659
678
- },
679
- "Strength":{
680
- "p":0.9841269841,
681
- "r":0.9841269841,
682
- "f":0.9841269841
683
- },
684
- "Mood":{
685
- "p":0.9446428571,
686
- "r":0.9706422018,
687
- "f":0.9574660633
688
- },
689
- "Tense":{
690
- "p":0.9470672389,
691
- "r":0.9622093023,
692
- "f":0.9545782264
693
- },
694
- "Variant":{
695
- "p":0.9867549669,
696
- "r":0.9612903226,
697
- "f":0.9738562092
698
- },
699
- "Position":{
700
- "p":1.0,
701
- "r":0.9910714286,
702
- "f":0.9955156951
703
- },
704
- "Number[psor]":{
705
- "p":1.0,
706
- "r":0.9666666667,
707
- "f":0.9830508475
708
- },
709
- "PartType":{
710
- "p":1.0,
711
- "r":0.9459459459,
712
- "f":0.9722222222
713
- },
714
- "Foreign":{
715
- "p":0.0,
716
- "r":0.0,
717
- "f":0.0
718
- }
719
- },
720
  "dep_las_per_type":{
721
  "root":{
722
- "p":0.8519855596,
723
- "r":0.8888888889,
724
- "f":0.8700460829
725
  },
726
  "mark":{
727
- "p":0.9138576779,
728
- "r":0.920754717,
729
- "f":0.9172932331
730
  },
731
  "case":{
732
- "p":0.9520392749,
733
- "r":0.9452568429,
734
- "f":0.948635936
735
  },
736
  "nmod:tmod":{
737
- "p":0.5151515152,
738
- "r":0.1428571429,
739
- "f":0.2236842105
740
  },
741
  "amod":{
742
- "p":0.8744625967,
743
- "r":0.8662691652,
744
- "f":0.8703465982
745
  },
746
  "nsubj":{
747
- "p":0.8336106489,
748
  "r":0.7914691943,
749
- "f":0.811993517
750
  },
751
  "nmod":{
752
- "p":0.8005068791,
753
- "r":0.7893609425,
754
- "f":0.7948948409
755
  },
756
  "aux":{
757
- "p":0.9757462687,
758
- "r":0.9561243144,
759
- "f":0.9658356417
760
  },
761
  "advcl":{
762
- "p":0.5620915033,
763
- "r":0.6466165414,
764
- "f":0.6013986014
765
  },
766
  "obj":{
767
- "p":0.7931769723,
768
- "r":0.8591224018,
769
- "f":0.8248337029
770
  },
771
  "det":{
772
- "p":0.9598623853,
773
- "r":0.9479048698,
774
- "f":0.9538461538
775
  },
776
  "cc":{
777
- "p":0.9182389937,
778
- "r":0.9144050104,
779
- "f":0.9163179916
780
  },
781
  "conj":{
782
- "p":0.5696969697,
783
- "r":0.5446118192,
784
- "f":0.5568720379
785
  },
786
  "nummod":{
787
- "p":0.8888888889,
788
- "r":0.8571428571,
789
- "f":0.8727272727
790
  },
791
  "acl":{
792
- "p":0.7819767442,
793
- "r":0.6950904393,
794
- "f":0.7359781122
795
  },
796
  "advmod":{
797
- "p":0.7928388747,
798
- "r":0.8179419525,
799
- "f":0.8051948052
800
  },
801
  "obl":{
802
- "p":0.6563500534,
803
- "r":0.7804568528,
804
- "f":0.7130434783
805
  },
806
  "expl:pass":{
807
- "p":0.8372093023,
808
- "r":0.6666666667,
809
- "f":0.7422680412
810
  },
811
  "nsubj:pass":{
812
- "p":0.8582089552,
813
- "r":0.7012195122,
814
- "f":0.7718120805
815
  },
816
  "fixed":{
817
- "p":0.8381742739,
818
- "r":0.8541226216,
819
- "f":0.8460732984
820
  },
821
  "appos":{
822
- "p":0.4291666667,
823
  "r":0.393129771,
824
- "f":0.4103585657
825
  },
826
  "parataxis":{
827
- "p":0.2045454545,
828
- "r":0.2571428571,
829
- "f":0.2278481013
830
  },
831
  "aux:pass":{
832
- "p":0.9256756757,
833
  "r":0.9133333333,
834
- "f":0.9194630872
835
  },
836
  "nmod:agent":{
837
- "p":0.9462365591,
838
- "r":0.7927927928,
839
- "f":0.862745098
840
  },
841
  "ccomp":{
842
- "p":0.8770491803,
843
- "r":0.8294573643,
844
- "f":0.8525896414
845
  },
846
  "nmod:pmod":{
847
- "p":0.3555555556,
848
- "r":0.2666666667,
849
- "f":0.3047619048
850
  },
851
  "iobj":{
852
- "p":0.7619047619,
853
- "r":0.7901234568,
854
- "f":0.7757575758
855
  },
856
  "flat":{
857
- "p":0.7560321716,
858
- "r":0.7421052632,
859
- "f":0.7490039841
860
  },
861
  "cop":{
862
- "p":0.7952755906,
863
- "r":0.814516129,
864
- "f":0.8047808765
865
  },
866
  "csubj":{
867
- "p":0.7619047619,
868
- "r":0.7619047619,
869
- "f":0.7619047619
870
  },
871
  "dep":{
872
  "p":0.0,
873
  "r":0.0,
874
  "f":0.0
875
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
876
  "expl:pv":{
877
- "p":0.7532467532,
878
- "r":0.8405797101,
879
- "f":0.7945205479
880
  },
881
  "expl":{
882
- "p":0.7037037037,
883
- "r":0.7037037037,
884
- "f":0.7037037037
885
  },
886
  "vocative":{
887
  "p":0.0,
@@ -889,141 +793,260 @@
889
  "f":0.0
890
  },
891
  "expl:poss":{
892
- "p":0.9310344828,
893
- "r":0.8709677419,
894
- "f":0.9
895
  },
896
  "goeswith":{
897
- "p":0.1,
898
- "r":0.3333333333,
899
- "f":0.1538461538
900
- },
901
- "xcomp":{
902
- "p":0.325,
903
- "r":0.4814814815,
904
- "f":0.3880597015
905
- },
906
- "compound":{
907
- "p":0.3333333333,
908
- "r":0.5714285714,
909
- "f":0.4210526316
910
- },
911
- "ccomp:pmod":{
912
  "p":0.0,
913
  "r":0.0,
914
  "f":0.0
915
  },
 
 
 
 
 
916
  "orphan":{
917
  "p":0.0,
918
  "r":0.0,
919
  "f":0.0
920
  },
921
  "expl:impers":{
922
- "p":0.5,
923
- "r":0.3333333333,
924
- "f":0.4
 
 
 
 
 
925
  },
926
  "list":{
927
  "p":0.0,
928
  "r":0.0,
929
  "f":0.0
930
  },
 
 
 
 
 
931
  "cc:preconj":{
932
  "p":0.0,
933
  "r":0.0,
934
  "f":0.0
 
 
 
 
 
 
 
 
 
 
 
 
935
  },
936
- "csubj:pass":{
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
937
  "p":0.0,
938
  "r":0.0,
939
  "f":0.0
940
  }
941
  },
942
- "ents_p":0.7244897959,
943
- "ents_r":0.7091817134,
944
- "ents_f":0.7167540283,
 
945
  "ents_per_type":{
946
  "DATETIME":{
947
- "p":0.7889273356,
948
- "r":0.7944250871,
949
- "f":0.7916666667
950
  },
951
  "ORGANIZATION":{
952
- "p":0.6656050955,
953
- "r":0.6656050955,
954
- "f":0.6656050955
955
  },
956
  "FACILITY":{
957
- "p":0.4953271028,
958
  "r":0.4045801527,
959
- "f":0.4453781513
960
  },
961
  "NUMERIC_VALUE":{
962
- "p":0.875,
963
- "r":0.9194915254,
964
- "f":0.8966942149
965
  },
966
  "ORDINAL":{
967
- "p":0.7962962963,
968
- "r":0.7818181818,
969
- "f":0.7889908257
970
  },
971
  "EVENT":{
972
- "p":0.6551724138,
973
- "r":0.5135135135,
974
- "f":0.5757575758
975
  },
976
  "GPE":{
977
- "p":0.8025477707,
978
- "r":0.8689655172,
979
- "f":0.8344370861
980
  },
981
  "PERSON":{
982
- "p":0.675862069,
983
- "r":0.6577181208,
984
- "f":0.6666666667
985
  },
986
  "NAT_REL_POL":{
987
- "p":0.8976377953,
988
- "r":0.76,
989
- "f":0.8231046931
990
  },
991
  "MONEY":{
992
- "p":0.8867924528,
993
- "r":0.8103448276,
994
- "f":0.8468468468
995
  },
996
  "PRODUCT":{
997
- "p":0.4424778761,
998
- "r":0.3649635036,
999
- "f":0.4
1000
  },
1001
  "LOC":{
1002
- "p":0.4461538462,
1003
- "r":0.3815789474,
1004
- "f":0.4113475177
1005
  },
1006
  "WORK_OF_ART":{
1007
- "p":0.24,
1008
- "r":0.3157894737,
1009
- "f":0.2727272727
1010
  },
1011
  "QUANTITY":{
1012
- "p":0.8518518519,
1013
  "r":0.8846153846,
1014
- "f":0.8679245283
1015
  },
1016
  "LANGUAGE":{
1017
- "p":0.5714285714,
1018
- "r":1.0,
1019
- "f":0.7272727273
1020
  },
1021
  "PERIOD":{
1022
- "p":0.8717948718,
1023
- "r":0.8095238095,
1024
- "f":0.8395061728
1025
  }
1026
- }
 
1027
  },
1028
  "sources":[
1029
  {
@@ -1033,7 +1056,7 @@
1033
  "author":"Michal M\u011bchura"
1034
  },
1035
  {
1036
- "name":"UD Romanian RRT v2.5",
1037
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1038
  "license":"CC BY-SA 4.0",
1039
  "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
 
1
  {
2
  "lang":"ro",
3
  "name":"core_news_sm",
4
+ "version":"3.2.0",
5
  "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"[email protected]",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.2.0,<3.3.0",
11
+ "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
 
30
  "Afp",
31
  "Afp-p-n",
32
  "Afp-poy",
33
+ "Afp-srn",
34
  "Afpf--n",
35
  "Afpfp-n",
36
  "Afpfp-ny",
 
132
  "Ds2ms-s",
133
  "Ds3---p",
134
  "Ds3---s",
135
+ "Ds3---sy",
136
  "Ds3fp-s",
137
  "Ds3fsos",
138
  "Ds3fsrs",
 
161
  "LSQR",
162
  "LT",
163
  "M",
 
164
  "Mc-p-d",
165
  "Mc-p-l",
166
+ "Mc-s-b",
167
+ "Mc-s-d",
168
+ "Mc-s-l",
169
  "Mcfp-l",
170
  "Mcfp-ln",
171
  "Mcfprln",
172
  "Mcfprly",
173
  "Mcfsoln",
174
+ "Mcfsrl",
175
  "Mcfsrln",
176
+ "Mcfsrly",
177
  "Mcmp-l",
178
  "Mcms-ln",
179
  "Mcmsrl",
180
+ "Mcmsrln",
181
  "Mcmsrly",
182
  "Mffprln",
183
  "Mffsrln",
 
250
  "Pd3mpr--y",
251
  "Pd3mso",
252
  "Pd3msr",
 
253
  "Pi3--r",
254
  "Pi3-po",
255
  "Pi3-so",
 
295
  "Pp3-po--------s",
296
  "Pp3-sd--------w",
297
  "Pp3-sd--y-----w",
298
+ "Pp3-so--------s",
299
  "Pp3fpa--------w",
300
  "Pp3fpa--y-----w",
301
  "Pp3fpr--------s",
 
322
  "Ps2fp-s",
323
  "Ps2fsrp",
324
  "Ps2fsrs",
 
325
  "Ps3---p",
326
  "Ps3---s",
327
  "Ps3fp-s",
 
354
  "RPAR",
355
  "RSQR",
356
  "Rc",
 
357
  "Rgp",
358
  "Rgpy",
359
  "Rgs",
 
411
  "Va--3s",
412
  "Va--3s----y",
413
  "Vag",
414
+ "Vag-------y",
415
  "Vaii1",
416
  "Vaii2s",
417
  "Vaii3p",
 
481
  "Vmp--sm",
482
  "Vmp--sm---y",
483
  "Vmsp1p",
484
+ "Vmsp2p",
485
  "Vmsp2s",
486
  "Vmsp3",
487
  "Vmsp3-----y",
 
494
  "Ynmsoy",
495
  "Ynmsry",
496
  "Yp",
497
+ "Yp,Yn",
498
  "Yp-sr",
499
  "Yr"
500
  ],
 
532
  "iobj",
533
  "mark",
534
  "nmod",
 
 
535
  "nmod:tmod",
536
  "nsubj",
537
  "nsubj:pass",
538
  "nummod",
539
  "obj",
540
  "obl",
541
+ "obl:agent",
542
+ "obl:pmod",
543
  "orphan",
544
  "parataxis",
545
  "punct",
 
597
  ],
598
  "performance":{
599
  "token_acc":0.9990029326,
600
+ "token_p":0.9967350492,
601
+ "token_r":0.9957244934,
602
+ "token_f":0.9959492157,
603
+ "tag_acc":0.9540563521,
604
+ "sents_p":0.9679572764,
605
+ "sents_r":0.9640957447,
606
+ "sents_f":0.9660226516,
607
+ "dep_uas":0.8691229961,
608
+ "dep_las":0.8114296468,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
609
  "dep_las_per_type":{
610
  "root":{
611
+ "p":0.8723021583,
612
+ "r":0.9133709981,
613
+ "f":0.8923643054
614
  },
615
  "mark":{
616
+ "p":0.9135338346,
617
+ "r":0.9169811321,
618
+ "f":0.9152542373
619
  },
620
  "case":{
621
+ "p":0.9534009771,
622
+ "r":0.951256093,
623
+ "f":0.9523273273
624
  },
625
  "nmod:tmod":{
626
+ "p":0.6888888889,
627
+ "r":0.2605042017,
628
+ "f":0.3780487805
629
  },
630
  "amod":{
631
+ "p":0.88,
632
+ "r":0.8807495741,
633
+ "f":0.8803746275
634
  },
635
  "nsubj":{
636
+ "p":0.835,
637
  "r":0.7914691943,
638
+ "f":0.8126520681
639
  },
640
  "nmod":{
641
+ "p":0.7983435362,
642
+ "r":0.7915030346,
643
+ "f":0.7949085694
644
  },
645
  "aux":{
646
+ "p":0.9665427509,
647
+ "r":0.9506398537,
648
+ "f":0.9585253456
649
  },
650
  "advcl":{
651
+ "p":0.5862068966,
652
+ "r":0.6390977444,
653
+ "f":0.6115107914
654
  },
655
  "obj":{
656
+ "p":0.800443459,
657
+ "r":0.8337182448,
658
+ "f":0.8167420814
659
  },
660
  "det":{
661
+ "p":0.9417142857,
662
+ "r":0.933182333,
663
+ "f":0.9374288965
664
  },
665
  "cc":{
666
+ "p":0.9206680585,
667
+ "r":0.9206680585,
668
+ "f":0.9206680585
669
  },
670
  "conj":{
671
+ "p":0.5717948718,
672
+ "r":0.516801854,
673
+ "f":0.5429093122
674
  },
675
  "nummod":{
676
+ "p":0.8808777429,
677
+ "r":0.8726708075,
678
+ "f":0.8767550702
679
  },
680
  "acl":{
681
+ "p":0.7312138728,
682
+ "r":0.65374677,
683
+ "f":0.690313779
684
  },
685
  "advmod":{
686
+ "p":0.774535809,
687
+ "r":0.7704485488,
688
+ "f":0.7724867725
689
  },
690
  "obl":{
691
+ "p":0.6485042735,
692
+ "r":0.7703045685,
693
+ "f":0.7041763341
694
  },
695
  "expl:pass":{
696
+ "p":0.8367346939,
697
+ "r":0.7592592593,
698
+ "f":0.7961165049
699
  },
700
  "nsubj:pass":{
701
+ "p":0.8206896552,
702
+ "r":0.7256097561,
703
+ "f":0.7702265372
704
  },
705
  "fixed":{
706
+ "p":0.867803838,
707
+ "r":0.8604651163,
708
+ "f":0.864118896
709
  },
710
  "appos":{
711
+ "p":0.4136546185,
712
  "r":0.393129771,
713
+ "f":0.4031311155
714
  },
715
  "parataxis":{
716
+ "p":0.2272727273,
717
+ "r":0.2857142857,
718
+ "f":0.253164557
719
  },
720
  "aux:pass":{
721
+ "p":0.9133333333,
722
  "r":0.9133333333,
723
+ "f":0.9133333333
724
  },
725
  "nmod:agent":{
726
+ "p":0.0,
727
+ "r":0.0,
728
+ "f":0.0
729
  },
730
  "ccomp":{
731
+ "p":0.8548387097,
732
+ "r":0.8217054264,
733
+ "f":0.837944664
734
  },
735
  "nmod:pmod":{
736
+ "p":0.0,
737
+ "r":0.0,
738
+ "f":0.0
739
  },
740
  "iobj":{
741
+ "p":0.6842105263,
742
+ "r":0.6419753086,
743
+ "f":0.6624203822
744
  },
745
  "flat":{
746
+ "p":0.7493333333,
747
+ "r":0.7394736842,
748
+ "f":0.7443708609
749
  },
750
  "cop":{
751
+ "p":0.8034188034,
752
+ "r":0.7580645161,
753
+ "f":0.7800829876
754
  },
755
  "csubj":{
756
+ "p":1.0,
757
+ "r":0.8095238095,
758
+ "f":0.8947368421
759
  },
760
  "dep":{
761
  "p":0.0,
762
  "r":0.0,
763
  "f":0.0
764
  },
765
+ "obl:agent":{
766
+ "p":0.0,
767
+ "r":0.0,
768
+ "f":0.0
769
+ },
770
+ "xcomp":{
771
+ "p":0.3658536585,
772
+ "r":0.5555555556,
773
+ "f":0.4411764706
774
+ },
775
+ "obl:pmod":{
776
+ "p":0.0,
777
+ "r":0.0,
778
+ "f":0.0
779
+ },
780
  "expl:pv":{
781
+ "p":0.7808219178,
782
+ "r":0.8260869565,
783
+ "f":0.8028169014
784
  },
785
  "expl":{
786
+ "p":0.696969697,
787
+ "r":0.8518518519,
788
+ "f":0.7666666667
789
  },
790
  "vocative":{
791
  "p":0.0,
 
793
  "f":0.0
794
  },
795
  "expl:poss":{
796
+ "p":1.0,
797
+ "r":0.9677419355,
798
+ "f":0.9836065574
799
  },
800
  "goeswith":{
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
801
  "p":0.0,
802
  "r":0.0,
803
  "f":0.0
804
  },
805
+ "ccomp:pmod":{
806
+ "p":0.0909090909,
807
+ "r":0.3333333333,
808
+ "f":0.1428571429
809
+ },
810
  "orphan":{
811
  "p":0.0,
812
  "r":0.0,
813
  "f":0.0
814
  },
815
  "expl:impers":{
816
+ "p":0.0,
817
+ "r":0.0,
818
+ "f":0.0
819
+ },
820
+ "compound":{
821
+ "p":0.75,
822
+ "r":0.4285714286,
823
+ "f":0.5454545455
824
  },
825
  "list":{
826
  "p":0.0,
827
  "r":0.0,
828
  "f":0.0
829
  },
830
+ "csubj:pass":{
831
+ "p":0.0,
832
+ "r":0.0,
833
+ "f":0.0
834
+ },
835
  "cc:preconj":{
836
  "p":0.0,
837
  "r":0.0,
838
  "f":0.0
839
+ }
840
+ },
841
+ "pos_acc":0.9297553275,
842
+ "morph_acc":0.9386929369,
843
+ "morph_micro_p":0.9832628288,
844
+ "morph_micro_r":0.9507350539,
845
+ "morph_micro_f":0.9636742939,
846
+ "morph_per_feat":{
847
+ "Case":{
848
+ "p":0.9877237852,
849
+ "r":0.9823222689,
850
+ "f":0.985015622
851
  },
852
+ "Gender":{
853
+ "p":0.9819398838,
854
+ "r":0.9719964996,
855
+ "f":0.9769428912
856
+ },
857
+ "Number":{
858
+ "p":0.9836679143,
859
+ "r":0.9136205625,
860
+ "f":0.9473511742
861
+ },
862
+ "Person":{
863
+ "p":0.982373678,
864
+ "r":0.9852681202,
865
+ "f":0.9838187702
866
+ },
867
+ "PronType":{
868
+ "p":0.9937629938,
869
+ "r":0.991015895,
870
+ "f":0.9923875433
871
+ },
872
+ "Polarity":{
873
+ "p":0.9918433931,
874
+ "r":0.9967213115,
875
+ "f":0.9942763696
876
+ },
877
+ "AdpType":{
878
+ "p":0.999660787,
879
+ "r":0.9972927242,
880
+ "f":0.9984753515
881
+ },
882
+ "Definite":{
883
+ "p":0.9843724827,
884
+ "r":0.9698412698,
885
+ "f":0.9770528504
886
+ },
887
+ "Degree":{
888
+ "p":0.9400413508,
889
+ "r":0.916051041,
890
+ "f":0.9278911565
891
+ },
892
+ "VerbForm":{
893
+ "p":0.9644736842,
894
+ "r":0.9747340426,
895
+ "f":0.9695767196
896
+ },
897
+ "Abbr":{
898
+ "p":0.9754901961,
899
+ "r":0.8883928571,
900
+ "f":0.9299065421
901
+ },
902
+ "Poss":{
903
+ "p":1.0,
904
+ "r":0.9855421687,
905
+ "f":0.9927184466
906
+ },
907
+ "NumForm":{
908
+ "p":1.0,
909
+ "r":0.3223140496,
910
+ "f":0.4875
911
+ },
912
+ "NumType":{
913
+ "p":1.0,
914
+ "r":0.3241758242,
915
+ "f":0.489626556
916
+ },
917
+ "Reflex":{
918
+ "p":1.0,
919
+ "r":0.9935897436,
920
+ "f":0.9967845659
921
+ },
922
+ "Strength":{
923
+ "p":0.9919354839,
924
+ "r":0.9761904762,
925
+ "f":0.984
926
+ },
927
+ "Mood":{
928
+ "p":0.9584086799,
929
+ "r":0.9724770642,
930
+ "f":0.9653916211
931
+ },
932
+ "Tense":{
933
+ "p":0.9596541787,
934
+ "r":0.9680232558,
935
+ "f":0.9638205499
936
+ },
937
+ "Variant":{
938
+ "p":0.9865771812,
939
+ "r":0.9483870968,
940
+ "f":0.9671052632
941
+ },
942
+ "Position":{
943
+ "p":0.9910714286,
944
+ "r":0.9910714286,
945
+ "f":0.9910714286
946
+ },
947
+ "Number[psor]":{
948
+ "p":1.0,
949
+ "r":0.9,
950
+ "f":0.9473684211
951
+ },
952
+ "PartType":{
953
+ "p":0.9722222222,
954
+ "r":0.9459459459,
955
+ "f":0.9589041096
956
+ },
957
+ "Foreign":{
958
  "p":0.0,
959
  "r":0.0,
960
  "f":0.0
961
  }
962
  },
963
+ "lemma_acc":0.8183070924,
964
+ "ents_p":0.7193937038,
965
+ "ents_r":0.711102574,
966
+ "ents_f":0.7152241113,
967
  "ents_per_type":{
968
  "DATETIME":{
969
+ "p":0.7883959044,
970
+ "r":0.8048780488,
971
+ "f":0.7965517241
972
  },
973
  "ORGANIZATION":{
974
+ "p":0.6325301205,
975
+ "r":0.6687898089,
976
+ "f":0.6501547988
977
  },
978
  "FACILITY":{
979
+ "p":0.4862385321,
980
  "r":0.4045801527,
981
+ "f":0.4416666667
982
  },
983
  "NUMERIC_VALUE":{
984
+ "p":0.9227467811,
985
+ "r":0.9110169492,
986
+ "f":0.9168443497
987
  },
988
  "ORDINAL":{
989
+ "p":0.7857142857,
990
+ "r":0.8,
991
+ "f":0.7927927928
992
  },
993
  "EVENT":{
994
+ "p":0.5294117647,
995
+ "r":0.4864864865,
996
+ "f":0.5070422535
997
  },
998
  "GPE":{
999
+ "p":0.7953586498,
1000
+ "r":0.8666666667,
1001
+ "f":0.8294829483
1002
  },
1003
  "PERSON":{
1004
+ "p":0.6728187919,
1005
+ "r":0.6728187919,
1006
+ "f":0.6728187919
1007
  },
1008
  "NAT_REL_POL":{
1009
+ "p":0.8854961832,
1010
+ "r":0.7733333333,
1011
+ "f":0.8256227758
1012
  },
1013
  "MONEY":{
1014
+ "p":0.9230769231,
1015
+ "r":0.8275862069,
1016
+ "f":0.8727272727
1017
  },
1018
  "PRODUCT":{
1019
+ "p":0.427184466,
1020
+ "r":0.3211678832,
1021
+ "f":0.3666666667
1022
  },
1023
  "LOC":{
1024
+ "p":0.4782608696,
1025
+ "r":0.4342105263,
1026
+ "f":0.4551724138
1027
  },
1028
  "WORK_OF_ART":{
1029
+ "p":0.2222222222,
1030
+ "r":0.2105263158,
1031
+ "f":0.2162162162
1032
  },
1033
  "QUANTITY":{
1034
+ "p":0.7419354839,
1035
  "r":0.8846153846,
1036
+ "f":0.8070175439
1037
  },
1038
  "LANGUAGE":{
1039
+ "p":0.6,
1040
+ "r":0.75,
1041
+ "f":0.6666666667
1042
  },
1043
  "PERIOD":{
1044
+ "p":0.8378378378,
1045
+ "r":0.7380952381,
1046
+ "f":0.7848101266
1047
  }
1048
+ },
1049
+ "speed":8496.149813704
1050
  },
1051
  "sources":[
1052
  {
 
1056
  "author":"Michal M\u011bchura"
1057
  },
1058
  {
1059
+ "name":"UD Romanian RRT v2.8",
1060
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1061
  "license":"CC BY-SA 4.0",
1062
  "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
ner/model CHANGED
Binary files a/ner/model and b/ner/model differ
 
parser/model CHANGED
Binary files a/parser/model and b/parser/model differ
 
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�{"0":{"":85972},"1":{"":90580},"2":{"case":22318,"punct":9077,"det":9009,"nsubj":7125,"advmod":6350,"cc":5364,"mark":5291,"aux":4018,"obl":2015,"nummod":1880,"expl:pv":1798,"cop":1706,"amod":1376,"aux:pass":1369,"nsubj:pass":963,"expl:pass":909,"parataxis":877,"obj":866,"advcl":710,"iobj":567,"expl:poss":464,"expl":390,"nmod":204,"nsubj||csubj":154,"nmod:tmod":152,"expl:impers":102,"xcomp":97,"advmod:tmod":85,"nmod:pmod":74,"cc:preconj":63,"csubj":58,"nsubj:pass||csubj":57,"obj||ccomp":44,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14423,"amod":9673,"obl":7745,"conj":7281,"fixed":5595,"obj":5457,"acl":4102,"advmod":2145,"advcl":2043,"ccomp":1929,"nummod":1646,"nsubj":1278,"nmod:pmod":1208,"flat":1160,"det":1031,"appos":915,"xcomp":886,"iobj":804,"nmod:agent":718,"csubj":626,"nsubj:pass":546,"case":442,"parataxis":426,"nmod:tmod":286,"goeswith":245,"ccomp:pmod":174,"cc":124,"cop":100,"expl:pv":86,"expl":55,"advcl:tcl":52,"compound":50,"csubj:pass":49,"expl:poss":36,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
 
1
+ ��moves� {"0":{"":86134},"1":{"":90421},"2":{"case":22293,"punct":9078,"det":9035,"nsubj":7080,"advmod":6417,"mark":5380,"cc":5367,"aux":4002,"obl":2028,"nummod":1887,"expl:pv":1796,"cop":1712,"aux:pass":1372,"amod":1370,"nsubj:pass":1013,"expl:pass":910,"parataxis":878,"obj":868,"advcl":713,"iobj":564,"expl:poss":469,"expl":393,"nmod":203,"nsubj||csubj":155,"nmod:tmod":153,"expl:impers":102,"xcomp":97,"advmod:tmod":84,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":45,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14500,"amod":9699,"obl":7775,"conj":7286,"fixed":5485,"obj":5462,"acl":4105,"advmod":2099,"advcl":2049,"ccomp":1932,"nummod":1667,"nsubj":1280,"obl:pmod":1208,"flat":1167,"det":1035,"appos":915,"xcomp":891,"iobj":803,"obl:agent":719,"csubj":632,"nsubj:pass":554,"parataxis":435,"case":434,"nmod:tmod":283,"ccomp:pmod":178,"cc":123,"cop":100,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
ro_core_news_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:29fcd7c11fa94cfdb875cd8f250c87cb7c720bdc94a02fcedd734ede1a2ee408
3
- size 16363069
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dddd9698256565d7ec89658ea742ff738136afa01a33b81d67b1f2df4519459e
3
+ size 16645399
senter/cfg CHANGED
@@ -1,3 +1,3 @@
1
  {
2
-
3
  }
 
1
  {
2
+ "overwrite":false
3
  }
senter/model CHANGED
Binary files a/senter/model and b/senter/model differ
 
tagger/cfg CHANGED
@@ -10,6 +10,7 @@
10
  "Afp",
11
  "Afp-p-n",
12
  "Afp-poy",
 
13
  "Afpf--n",
14
  "Afpfp-n",
15
  "Afpfp-ny",
@@ -111,6 +112,7 @@
111
  "Ds2ms-s",
112
  "Ds3---p",
113
  "Ds3---s",
 
114
  "Ds3fp-s",
115
  "Ds3fsos",
116
  "Ds3fsrs",
@@ -139,18 +141,23 @@
139
  "LSQR",
140
  "LT",
141
  "M",
142
- "Mc",
143
  "Mc-p-d",
144
  "Mc-p-l",
 
 
 
145
  "Mcfp-l",
146
  "Mcfp-ln",
147
  "Mcfprln",
148
  "Mcfprly",
149
  "Mcfsoln",
 
150
  "Mcfsrln",
 
151
  "Mcmp-l",
152
  "Mcms-ln",
153
  "Mcmsrl",
 
154
  "Mcmsrly",
155
  "Mffprln",
156
  "Mffsrln",
@@ -223,7 +230,6 @@
223
  "Pd3mpr--y",
224
  "Pd3mso",
225
  "Pd3msr",
226
- "Pi3",
227
  "Pi3--r",
228
  "Pi3-po",
229
  "Pi3-so",
@@ -269,6 +275,7 @@
269
  "Pp3-po--------s",
270
  "Pp3-sd--------w",
271
  "Pp3-sd--y-----w",
 
272
  "Pp3fpa--------w",
273
  "Pp3fpa--y-----w",
274
  "Pp3fpr--------s",
@@ -295,7 +302,6 @@
295
  "Ps2fp-s",
296
  "Ps2fsrp",
297
  "Ps2fsrs",
298
- "Ps2ms-s",
299
  "Ps3---p",
300
  "Ps3---s",
301
  "Ps3fp-s",
@@ -328,7 +334,6 @@
328
  "RPAR",
329
  "RSQR",
330
  "Rc",
331
- "Rgc",
332
  "Rgp",
333
  "Rgpy",
334
  "Rgs",
@@ -386,6 +391,7 @@
386
  "Va--3s",
387
  "Va--3s----y",
388
  "Vag",
 
389
  "Vaii1",
390
  "Vaii2s",
391
  "Vaii3p",
@@ -455,7 +461,7 @@
455
  "Vmp--sm",
456
  "Vmp--sm---y",
457
  "Vmsp1p",
458
- "Vmsp1s",
459
  "Vmsp2s",
460
  "Vmsp3",
461
  "Vmsp3-----y",
@@ -468,7 +474,9 @@
468
  "Ynmsoy",
469
  "Ynmsry",
470
  "Yp",
 
471
  "Yp-sr",
472
  "Yr"
473
- ]
 
474
  }
 
10
  "Afp",
11
  "Afp-p-n",
12
  "Afp-poy",
13
+ "Afp-srn",
14
  "Afpf--n",
15
  "Afpfp-n",
16
  "Afpfp-ny",
 
112
  "Ds2ms-s",
113
  "Ds3---p",
114
  "Ds3---s",
115
+ "Ds3---sy",
116
  "Ds3fp-s",
117
  "Ds3fsos",
118
  "Ds3fsrs",
 
141
  "LSQR",
142
  "LT",
143
  "M",
 
144
  "Mc-p-d",
145
  "Mc-p-l",
146
+ "Mc-s-b",
147
+ "Mc-s-d",
148
+ "Mc-s-l",
149
  "Mcfp-l",
150
  "Mcfp-ln",
151
  "Mcfprln",
152
  "Mcfprly",
153
  "Mcfsoln",
154
+ "Mcfsrl",
155
  "Mcfsrln",
156
+ "Mcfsrly",
157
  "Mcmp-l",
158
  "Mcms-ln",
159
  "Mcmsrl",
160
+ "Mcmsrln",
161
  "Mcmsrly",
162
  "Mffprln",
163
  "Mffsrln",
 
230
  "Pd3mpr--y",
231
  "Pd3mso",
232
  "Pd3msr",
 
233
  "Pi3--r",
234
  "Pi3-po",
235
  "Pi3-so",
 
275
  "Pp3-po--------s",
276
  "Pp3-sd--------w",
277
  "Pp3-sd--y-----w",
278
+ "Pp3-so--------s",
279
  "Pp3fpa--------w",
280
  "Pp3fpa--y-----w",
281
  "Pp3fpr--------s",
 
302
  "Ps2fp-s",
303
  "Ps2fsrp",
304
  "Ps2fsrs",
 
305
  "Ps3---p",
306
  "Ps3---s",
307
  "Ps3fp-s",
 
334
  "RPAR",
335
  "RSQR",
336
  "Rc",
 
337
  "Rgp",
338
  "Rgpy",
339
  "Rgs",
 
391
  "Va--3s",
392
  "Va--3s----y",
393
  "Vag",
394
+ "Vag-------y",
395
  "Vaii1",
396
  "Vaii2s",
397
  "Vaii3p",
 
461
  "Vmp--sm",
462
  "Vmp--sm---y",
463
  "Vmsp1p",
464
+ "Vmsp2p",
465
  "Vmsp2s",
466
  "Vmsp3",
467
  "Vmsp3-----y",
 
474
  "Ynmsoy",
475
  "Ynmsry",
476
  "Yp",
477
+ "Yp,Yn",
478
  "Yp-sr",
479
  "Yr"
480
+ ],
481
+ "overwrite":false
482
  }
tagger/model CHANGED
Binary files a/tagger/model and b/tagger/model differ
 
tok2vec/model CHANGED
Binary files a/tok2vec/model and b/tok2vec/model differ
 
tokenizer CHANGED
@@ -1,3 +1,3 @@
1
- ��prefix_search�
2
  ��A�
3
- � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
 
1
+ ��prefix_search�
2
  ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:559be5fcdd8d099c39c785a46b22493570df3d7f7ced2f87e9b25f59292b77bd
3
- size 1089368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb4b6f8ed432c0240717aaf5b7b61d5e5825362699b3ae4a543fbbd012a9e0e8
3
+ size 1099352
vocab/vectors.cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "mode":"default"
3
+ }