osanseviero
commited on
Commit
•
07bde64
1
Parent(s):
da62d3d
Update spaCy pipeline
Browse files- LICENSES_SOURCES +1 -1
- README.md +34 -28
- accuracy.json +308 -292
- attribute_ruler/patterns +0 -0
- config.cfg +28 -25
- meta.json +326 -303
- ner/model +0 -0
- parser/model +0 -0
- parser/moves +1 -1
- ro_core_news_sm-any-py3-none-any.whl +2 -2
- senter/cfg +1 -1
- senter/model +0 -0
- tagger/cfg +14 -6
- tagger/model +0 -0
- tok2vec/model +0 -0
- tokenizer +2 -2
- vocab/strings.json +2 -2
- vocab/vectors.cfg +3 -0
LICENSES_SOURCES
CHANGED
@@ -549,7 +549,7 @@ terms of this License.```
|
|
549 |
|
550 |
|
551 |
|
552 |
-
# UD Romanian RRT v2.
|
553 |
|
554 |
* Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
|
555 |
* URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
|
|
|
549 |
|
550 |
|
551 |
|
552 |
+
# UD Romanian RRT v2.8
|
553 |
|
554 |
* Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
|
555 |
* URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
|
README.md
CHANGED
@@ -4,7 +4,7 @@ tags:
|
|
4 |
- token-classification
|
5 |
language:
|
6 |
- ro
|
7 |
-
license:
|
8 |
model-index:
|
9 |
- name: ro_core_news_sm
|
10 |
results:
|
@@ -14,47 +14,47 @@ model-index:
|
|
14 |
metrics:
|
15 |
- name: NER Precision
|
16 |
type: precision
|
17 |
-
value: 0.
|
18 |
- name: NER Recall
|
19 |
type: recall
|
20 |
-
value: 0.
|
21 |
- name: NER F Score
|
22 |
type: f_score
|
23 |
-
value: 0.
|
24 |
- task:
|
25 |
name: POS
|
26 |
type: token-classification
|
27 |
metrics:
|
28 |
- name: POS Accuracy
|
29 |
type: accuracy
|
30 |
-
value: 0.
|
31 |
- task:
|
32 |
name: SENTER
|
33 |
type: token-classification
|
34 |
metrics:
|
35 |
- name: SENTER Precision
|
36 |
type: precision
|
37 |
-
value: 0.
|
38 |
- name: SENTER Recall
|
39 |
type: recall
|
40 |
-
value: 0.
|
41 |
- name: SENTER F Score
|
42 |
type: f_score
|
43 |
-
value: 0.
|
44 |
- task:
|
45 |
name: UNLABELED_DEPENDENCIES
|
46 |
type: token-classification
|
47 |
metrics:
|
48 |
- name: Unlabeled Dependencies Accuracy
|
49 |
type: accuracy
|
50 |
-
value: 0.
|
51 |
- task:
|
52 |
name: LABELED_DEPENDENCIES
|
53 |
type: token-classification
|
54 |
metrics:
|
55 |
- name: Labeled Dependencies Accuracy
|
56 |
type: accuracy
|
57 |
-
value: 0.
|
58 |
---
|
59 |
### Details: https://spacy.io/models/ro#ro_core_news_sm
|
60 |
|
@@ -63,12 +63,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
65 |
| **Name** | `ro_core_news_sm` |
|
66 |
-
| **Version** | `3.
|
67 |
-
| **spaCy** | `>=3.
|
68 |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
|
69 |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
|
70 |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
|
71 |
-
| **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.
|
72 |
| **License** | `CC BY-SA 4.0` |
|
73 |
| **Author** | [Explosion](https://explosion.ai) |
|
74 |
|
@@ -76,12 +76,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
76 |
|
77 |
<details>
|
78 |
|
79 |
-
<summary>View label scheme (
|
80 |
|
81 |
| Component | Labels |
|
82 |
| --- | --- |
|
83 |
-
| **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-
|
84 |
-
| **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:
|
85 |
| **`senter`** | `I`, `S` |
|
86 |
| **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
|
87 |
|
@@ -92,15 +92,21 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
92 |
| Type | Score |
|
93 |
| --- | --- |
|
94 |
| `TOKEN_ACC` | 99.90 |
|
95 |
-
| `
|
96 |
-
| `
|
97 |
-
| `
|
98 |
-
| `
|
99 |
-
| `
|
100 |
-
| `
|
101 |
-
| `
|
102 |
-
| `
|
103 |
-
| `
|
104 |
-
| `
|
105 |
-
| `
|
106 |
-
| `
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- token-classification
|
5 |
language:
|
6 |
- ro
|
7 |
+
license: cc-by-sa-4.0
|
8 |
model-index:
|
9 |
- name: ro_core_news_sm
|
10 |
results:
|
|
|
14 |
metrics:
|
15 |
- name: NER Precision
|
16 |
type: precision
|
17 |
+
value: 0.7193937038
|
18 |
- name: NER Recall
|
19 |
type: recall
|
20 |
+
value: 0.711102574
|
21 |
- name: NER F Score
|
22 |
type: f_score
|
23 |
+
value: 0.7152241113
|
24 |
- task:
|
25 |
name: POS
|
26 |
type: token-classification
|
27 |
metrics:
|
28 |
- name: POS Accuracy
|
29 |
type: accuracy
|
30 |
+
value: 0.9540563521
|
31 |
- task:
|
32 |
name: SENTER
|
33 |
type: token-classification
|
34 |
metrics:
|
35 |
- name: SENTER Precision
|
36 |
type: precision
|
37 |
+
value: 0.9679572764
|
38 |
- name: SENTER Recall
|
39 |
type: recall
|
40 |
+
value: 0.9640957447
|
41 |
- name: SENTER F Score
|
42 |
type: f_score
|
43 |
+
value: 0.9660226516
|
44 |
- task:
|
45 |
name: UNLABELED_DEPENDENCIES
|
46 |
type: token-classification
|
47 |
metrics:
|
48 |
- name: Unlabeled Dependencies Accuracy
|
49 |
type: accuracy
|
50 |
+
value: 0.8691229961
|
51 |
- task:
|
52 |
name: LABELED_DEPENDENCIES
|
53 |
type: token-classification
|
54 |
metrics:
|
55 |
- name: Labeled Dependencies Accuracy
|
56 |
type: accuracy
|
57 |
+
value: 0.8691229961
|
58 |
---
|
59 |
### Details: https://spacy.io/models/ro#ro_core_news_sm
|
60 |
|
|
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
65 |
| **Name** | `ro_core_news_sm` |
|
66 |
+
| **Version** | `3.2.0` |
|
67 |
+
| **spaCy** | `>=3.2.0,<3.3.0` |
|
68 |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
|
69 |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
|
70 |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
|
71 |
+
| **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan) |
|
72 |
| **License** | `CC BY-SA 4.0` |
|
73 |
| **Author** | [Explosion](https://explosion.ai) |
|
74 |
|
|
|
76 |
|
77 |
<details>
|
78 |
|
79 |
+
<summary>View label scheme (541 labels for 4 components)</summary>
|
80 |
|
81 |
| Component | Labels |
|
82 |
| --- | --- |
|
83 |
+
| **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
|
84 |
+
| **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
|
85 |
| **`senter`** | `I`, `S` |
|
86 |
| **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
|
87 |
|
|
|
92 |
| Type | Score |
|
93 |
| --- | --- |
|
94 |
| `TOKEN_ACC` | 99.90 |
|
95 |
+
| `TOKEN_P` | 99.67 |
|
96 |
+
| `TOKEN_R` | 99.57 |
|
97 |
+
| `TOKEN_F` | 99.59 |
|
98 |
+
| `TAG_ACC` | 95.41 |
|
99 |
+
| `SENTS_P` | 96.80 |
|
100 |
+
| `SENTS_R` | 96.41 |
|
101 |
+
| `SENTS_F` | 96.60 |
|
102 |
+
| `DEP_UAS` | 86.91 |
|
103 |
+
| `DEP_LAS` | 81.14 |
|
104 |
+
| `POS_ACC` | 92.98 |
|
105 |
+
| `MORPH_ACC` | 93.87 |
|
106 |
+
| `MORPH_MICRO_P` | 98.33 |
|
107 |
+
| `MORPH_MICRO_R` | 95.07 |
|
108 |
+
| `MORPH_MICRO_F` | 96.37 |
|
109 |
+
| `LEMMA_ACC` | 81.83 |
|
110 |
+
| `ENTS_P` | 71.94 |
|
111 |
+
| `ENTS_R` | 71.11 |
|
112 |
+
| `ENTS_F` | 71.52 |
|
accuracy.json
CHANGED
@@ -1,297 +1,194 @@
|
|
1 |
{
|
2 |
"token_acc": 0.9990029326,
|
3 |
-
"
|
4 |
-
"
|
5 |
-
"
|
6 |
-
"
|
7 |
-
"
|
8 |
-
"
|
9 |
-
"
|
10 |
-
"
|
11 |
-
"
|
12 |
-
"speed": 9573.3292142848,
|
13 |
-
"morph_per_feat": {
|
14 |
-
"Case": {
|
15 |
-
"p": 0.985836417,
|
16 |
-
"r": 0.9825766247,
|
17 |
-
"f": 0.9842038217
|
18 |
-
},
|
19 |
-
"Gender": {
|
20 |
-
"p": 0.9825625474,
|
21 |
-
"r": 0.9721215152,
|
22 |
-
"f": 0.9773141457
|
23 |
-
},
|
24 |
-
"Number": {
|
25 |
-
"p": 0.9856352415,
|
26 |
-
"r": 0.9757716212,
|
27 |
-
"f": 0.98067863
|
28 |
-
},
|
29 |
-
"Person": {
|
30 |
-
"p": 0.9789227166,
|
31 |
-
"r": 0.9852681202,
|
32 |
-
"f": 0.9820851689
|
33 |
-
},
|
34 |
-
"PronType": {
|
35 |
-
"p": 0.9951523546,
|
36 |
-
"r": 0.99308915,
|
37 |
-
"f": 0.9941196818
|
38 |
-
},
|
39 |
-
"Polarity": {
|
40 |
-
"p": 0.9918166939,
|
41 |
-
"r": 0.993442623,
|
42 |
-
"f": 0.9926289926
|
43 |
-
},
|
44 |
-
"AdpType": {
|
45 |
-
"p": 0.9976311337,
|
46 |
-
"r": 0.9976311337,
|
47 |
-
"f": 0.9976311337
|
48 |
-
},
|
49 |
-
"Definite": {
|
50 |
-
"p": 0.9854908915,
|
51 |
-
"r": 0.9703174603,
|
52 |
-
"f": 0.9778453171
|
53 |
-
},
|
54 |
-
"Degree": {
|
55 |
-
"p": 0.9436038514,
|
56 |
-
"r": 0.9214237743,
|
57 |
-
"f": 0.9323819232
|
58 |
-
},
|
59 |
-
"VerbForm": {
|
60 |
-
"p": 0.9632545932,
|
61 |
-
"r": 0.9760638298,
|
62 |
-
"f": 0.9696169089
|
63 |
-
},
|
64 |
-
"Abbr": {
|
65 |
-
"p": 0.9804878049,
|
66 |
-
"r": 0.8973214286,
|
67 |
-
"f": 0.9370629371
|
68 |
-
},
|
69 |
-
"Poss": {
|
70 |
-
"p": 1.0,
|
71 |
-
"r": 0.9951807229,
|
72 |
-
"f": 0.9975845411
|
73 |
-
},
|
74 |
-
"NumForm": {
|
75 |
-
"p": 0.9944134078,
|
76 |
-
"r": 0.9807162534,
|
77 |
-
"f": 0.987517337
|
78 |
-
},
|
79 |
-
"NumType": {
|
80 |
-
"p": 0.995821727,
|
81 |
-
"r": 0.9821428571,
|
82 |
-
"f": 0.9889349931
|
83 |
-
},
|
84 |
-
"Reflex": {
|
85 |
-
"p": 1.0,
|
86 |
-
"r": 0.9935897436,
|
87 |
-
"f": 0.9967845659
|
88 |
-
},
|
89 |
-
"Strength": {
|
90 |
-
"p": 0.9841269841,
|
91 |
-
"r": 0.9841269841,
|
92 |
-
"f": 0.9841269841
|
93 |
-
},
|
94 |
-
"Mood": {
|
95 |
-
"p": 0.9446428571,
|
96 |
-
"r": 0.9706422018,
|
97 |
-
"f": 0.9574660633
|
98 |
-
},
|
99 |
-
"Tense": {
|
100 |
-
"p": 0.9470672389,
|
101 |
-
"r": 0.9622093023,
|
102 |
-
"f": 0.9545782264
|
103 |
-
},
|
104 |
-
"Variant": {
|
105 |
-
"p": 0.9867549669,
|
106 |
-
"r": 0.9612903226,
|
107 |
-
"f": 0.9738562092
|
108 |
-
},
|
109 |
-
"Position": {
|
110 |
-
"p": 1.0,
|
111 |
-
"r": 0.9910714286,
|
112 |
-
"f": 0.9955156951
|
113 |
-
},
|
114 |
-
"Number[psor]": {
|
115 |
-
"p": 1.0,
|
116 |
-
"r": 0.9666666667,
|
117 |
-
"f": 0.9830508475
|
118 |
-
},
|
119 |
-
"PartType": {
|
120 |
-
"p": 1.0,
|
121 |
-
"r": 0.9459459459,
|
122 |
-
"f": 0.9722222222
|
123 |
-
},
|
124 |
-
"Foreign": {
|
125 |
-
"p": 0.0,
|
126 |
-
"r": 0.0,
|
127 |
-
"f": 0.0
|
128 |
-
}
|
129 |
-
},
|
130 |
"dep_las_per_type": {
|
131 |
"root": {
|
132 |
-
"p": 0.
|
133 |
-
"r": 0.
|
134 |
-
"f": 0.
|
135 |
},
|
136 |
"mark": {
|
137 |
-
"p": 0.
|
138 |
-
"r": 0.
|
139 |
-
"f": 0.
|
140 |
},
|
141 |
"case": {
|
142 |
-
"p": 0.
|
143 |
-
"r": 0.
|
144 |
-
"f": 0.
|
145 |
},
|
146 |
"nmod:tmod": {
|
147 |
-
"p": 0.
|
148 |
-
"r": 0.
|
149 |
-
"f": 0.
|
150 |
},
|
151 |
"amod": {
|
152 |
-
"p": 0.
|
153 |
-
"r": 0.
|
154 |
-
"f": 0.
|
155 |
},
|
156 |
"nsubj": {
|
157 |
-
"p": 0.
|
158 |
"r": 0.7914691943,
|
159 |
-
"f": 0.
|
160 |
},
|
161 |
"nmod": {
|
162 |
-
"p": 0.
|
163 |
-
"r": 0.
|
164 |
-
"f": 0.
|
165 |
},
|
166 |
"aux": {
|
167 |
-
"p": 0.
|
168 |
-
"r": 0.
|
169 |
-
"f": 0.
|
170 |
},
|
171 |
"advcl": {
|
172 |
-
"p": 0.
|
173 |
-
"r": 0.
|
174 |
-
"f": 0.
|
175 |
},
|
176 |
"obj": {
|
177 |
-
"p": 0.
|
178 |
-
"r": 0.
|
179 |
-
"f": 0.
|
180 |
},
|
181 |
"det": {
|
182 |
-
"p": 0.
|
183 |
-
"r": 0.
|
184 |
-
"f": 0.
|
185 |
},
|
186 |
"cc": {
|
187 |
-
"p": 0.
|
188 |
-
"r": 0.
|
189 |
-
"f": 0.
|
190 |
},
|
191 |
"conj": {
|
192 |
-
"p": 0.
|
193 |
-
"r": 0.
|
194 |
-
"f": 0.
|
195 |
},
|
196 |
"nummod": {
|
197 |
-
"p": 0.
|
198 |
-
"r": 0.
|
199 |
-
"f": 0.
|
200 |
},
|
201 |
"acl": {
|
202 |
-
"p": 0.
|
203 |
-
"r": 0.
|
204 |
-
"f": 0.
|
205 |
},
|
206 |
"advmod": {
|
207 |
-
"p": 0.
|
208 |
-
"r": 0.
|
209 |
-
"f": 0.
|
210 |
},
|
211 |
"obl": {
|
212 |
-
"p": 0.
|
213 |
-
"r": 0.
|
214 |
-
"f": 0.
|
215 |
},
|
216 |
"expl:pass": {
|
217 |
-
"p": 0.
|
218 |
-
"r": 0.
|
219 |
-
"f": 0.
|
220 |
},
|
221 |
"nsubj:pass": {
|
222 |
-
"p": 0.
|
223 |
-
"r": 0.
|
224 |
-
"f": 0.
|
225 |
},
|
226 |
"fixed": {
|
227 |
-
"p": 0.
|
228 |
-
"r": 0.
|
229 |
-
"f": 0.
|
230 |
},
|
231 |
"appos": {
|
232 |
-
"p": 0.
|
233 |
"r": 0.393129771,
|
234 |
-
"f": 0.
|
235 |
},
|
236 |
"parataxis": {
|
237 |
-
"p": 0.
|
238 |
-
"r": 0.
|
239 |
-
"f": 0.
|
240 |
},
|
241 |
"aux:pass": {
|
242 |
-
"p": 0.
|
243 |
"r": 0.9133333333,
|
244 |
-
"f": 0.
|
245 |
},
|
246 |
"nmod:agent": {
|
247 |
-
"p": 0.
|
248 |
-
"r": 0.
|
249 |
-
"f": 0.
|
250 |
},
|
251 |
"ccomp": {
|
252 |
-
"p": 0.
|
253 |
-
"r": 0.
|
254 |
-
"f": 0.
|
255 |
},
|
256 |
"nmod:pmod": {
|
257 |
-
"p": 0.
|
258 |
-
"r": 0.
|
259 |
-
"f": 0.
|
260 |
},
|
261 |
"iobj": {
|
262 |
-
"p": 0.
|
263 |
-
"r": 0.
|
264 |
-
"f": 0.
|
265 |
},
|
266 |
"flat": {
|
267 |
-
"p": 0.
|
268 |
-
"r": 0.
|
269 |
-
"f": 0.
|
270 |
},
|
271 |
"cop": {
|
272 |
-
"p": 0.
|
273 |
-
"r": 0.
|
274 |
-
"f": 0.
|
275 |
},
|
276 |
"csubj": {
|
277 |
-
"p": 0
|
278 |
-
"r": 0.
|
279 |
-
"f": 0.
|
280 |
},
|
281 |
"dep": {
|
282 |
"p": 0.0,
|
283 |
"r": 0.0,
|
284 |
"f": 0.0
|
285 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
286 |
"expl:pv": {
|
287 |
-
"p": 0.
|
288 |
-
"r": 0.
|
289 |
-
"f": 0.
|
290 |
},
|
291 |
"expl": {
|
292 |
-
"p": 0.
|
293 |
-
"r": 0.
|
294 |
-
"f": 0.
|
295 |
},
|
296 |
"vocative": {
|
297 |
"p": 0.0,
|
@@ -299,139 +196,258 @@
|
|
299 |
"f": 0.0
|
300 |
},
|
301 |
"expl:poss": {
|
302 |
-
"p": 0
|
303 |
-
"r": 0.
|
304 |
-
"f": 0.
|
305 |
},
|
306 |
"goeswith": {
|
307 |
-
"p": 0.1,
|
308 |
-
"r": 0.3333333333,
|
309 |
-
"f": 0.1538461538
|
310 |
-
},
|
311 |
-
"xcomp": {
|
312 |
-
"p": 0.325,
|
313 |
-
"r": 0.4814814815,
|
314 |
-
"f": 0.3880597015
|
315 |
-
},
|
316 |
-
"compound": {
|
317 |
-
"p": 0.3333333333,
|
318 |
-
"r": 0.5714285714,
|
319 |
-
"f": 0.4210526316
|
320 |
-
},
|
321 |
-
"ccomp:pmod": {
|
322 |
"p": 0.0,
|
323 |
"r": 0.0,
|
324 |
"f": 0.0
|
325 |
},
|
|
|
|
|
|
|
|
|
|
|
326 |
"orphan": {
|
327 |
"p": 0.0,
|
328 |
"r": 0.0,
|
329 |
"f": 0.0
|
330 |
},
|
331 |
"expl:impers": {
|
332 |
-
"p": 0.
|
333 |
-
"r": 0.
|
334 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
335 |
},
|
336 |
"list": {
|
337 |
"p": 0.0,
|
338 |
"r": 0.0,
|
339 |
"f": 0.0
|
340 |
},
|
|
|
|
|
|
|
|
|
|
|
341 |
"cc:preconj": {
|
342 |
"p": 0.0,
|
343 |
"r": 0.0,
|
344 |
"f": 0.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
345 |
},
|
346 |
-
"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
347 |
"p": 0.0,
|
348 |
"r": 0.0,
|
349 |
"f": 0.0
|
350 |
}
|
351 |
},
|
352 |
-
"
|
353 |
-
"
|
354 |
-
"
|
|
|
355 |
"ents_per_type": {
|
356 |
"DATETIME": {
|
357 |
-
"p": 0.
|
358 |
-
"r": 0.
|
359 |
-
"f": 0.
|
360 |
},
|
361 |
"ORGANIZATION": {
|
362 |
-
"p": 0.
|
363 |
-
"r": 0.
|
364 |
-
"f": 0.
|
365 |
},
|
366 |
"FACILITY": {
|
367 |
-
"p": 0.
|
368 |
"r": 0.4045801527,
|
369 |
-
"f": 0.
|
370 |
},
|
371 |
"NUMERIC_VALUE": {
|
372 |
-
"p": 0.
|
373 |
-
"r": 0.
|
374 |
-
"f": 0.
|
375 |
},
|
376 |
"ORDINAL": {
|
377 |
-
"p": 0.
|
378 |
-
"r": 0.
|
379 |
-
"f": 0.
|
380 |
},
|
381 |
"EVENT": {
|
382 |
-
"p": 0.
|
383 |
-
"r": 0.
|
384 |
-
"f": 0.
|
385 |
},
|
386 |
"GPE": {
|
387 |
-
"p": 0.
|
388 |
-
"r": 0.
|
389 |
-
"f": 0.
|
390 |
},
|
391 |
"PERSON": {
|
392 |
-
"p": 0.
|
393 |
-
"r": 0.
|
394 |
-
"f": 0.
|
395 |
},
|
396 |
"NAT_REL_POL": {
|
397 |
-
"p": 0.
|
398 |
-
"r": 0.
|
399 |
-
"f": 0.
|
400 |
},
|
401 |
"MONEY": {
|
402 |
-
"p": 0.
|
403 |
-
"r": 0.
|
404 |
-
"f": 0.
|
405 |
},
|
406 |
"PRODUCT": {
|
407 |
-
"p": 0.
|
408 |
-
"r": 0.
|
409 |
-
"f": 0.
|
410 |
},
|
411 |
"LOC": {
|
412 |
-
"p": 0.
|
413 |
-
"r": 0.
|
414 |
-
"f": 0.
|
415 |
},
|
416 |
"WORK_OF_ART": {
|
417 |
-
"p": 0.
|
418 |
-
"r": 0.
|
419 |
-
"f": 0.
|
420 |
},
|
421 |
"QUANTITY": {
|
422 |
-
"p": 0.
|
423 |
"r": 0.8846153846,
|
424 |
-
"f": 0.
|
425 |
},
|
426 |
"LANGUAGE": {
|
427 |
-
"p": 0.
|
428 |
-
"r":
|
429 |
-
"f": 0.
|
430 |
},
|
431 |
"PERIOD": {
|
432 |
-
"p": 0.
|
433 |
-
"r": 0.
|
434 |
-
"f": 0.
|
435 |
}
|
436 |
-
}
|
|
|
437 |
}
|
|
|
1 |
{
|
2 |
"token_acc": 0.9990029326,
|
3 |
+
"token_p": 0.9967350492,
|
4 |
+
"token_r": 0.9957244934,
|
5 |
+
"token_f": 0.9959492157,
|
6 |
+
"tag_acc": 0.9540563521,
|
7 |
+
"sents_p": 0.9679572764,
|
8 |
+
"sents_r": 0.9640957447,
|
9 |
+
"sents_f": 0.9660226516,
|
10 |
+
"dep_uas": 0.8691229961,
|
11 |
+
"dep_las": 0.8114296468,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
"dep_las_per_type": {
|
13 |
"root": {
|
14 |
+
"p": 0.8723021583,
|
15 |
+
"r": 0.9133709981,
|
16 |
+
"f": 0.8923643054
|
17 |
},
|
18 |
"mark": {
|
19 |
+
"p": 0.9135338346,
|
20 |
+
"r": 0.9169811321,
|
21 |
+
"f": 0.9152542373
|
22 |
},
|
23 |
"case": {
|
24 |
+
"p": 0.9534009771,
|
25 |
+
"r": 0.951256093,
|
26 |
+
"f": 0.9523273273
|
27 |
},
|
28 |
"nmod:tmod": {
|
29 |
+
"p": 0.6888888889,
|
30 |
+
"r": 0.2605042017,
|
31 |
+
"f": 0.3780487805
|
32 |
},
|
33 |
"amod": {
|
34 |
+
"p": 0.88,
|
35 |
+
"r": 0.8807495741,
|
36 |
+
"f": 0.8803746275
|
37 |
},
|
38 |
"nsubj": {
|
39 |
+
"p": 0.835,
|
40 |
"r": 0.7914691943,
|
41 |
+
"f": 0.8126520681
|
42 |
},
|
43 |
"nmod": {
|
44 |
+
"p": 0.7983435362,
|
45 |
+
"r": 0.7915030346,
|
46 |
+
"f": 0.7949085694
|
47 |
},
|
48 |
"aux": {
|
49 |
+
"p": 0.9665427509,
|
50 |
+
"r": 0.9506398537,
|
51 |
+
"f": 0.9585253456
|
52 |
},
|
53 |
"advcl": {
|
54 |
+
"p": 0.5862068966,
|
55 |
+
"r": 0.6390977444,
|
56 |
+
"f": 0.6115107914
|
57 |
},
|
58 |
"obj": {
|
59 |
+
"p": 0.800443459,
|
60 |
+
"r": 0.8337182448,
|
61 |
+
"f": 0.8167420814
|
62 |
},
|
63 |
"det": {
|
64 |
+
"p": 0.9417142857,
|
65 |
+
"r": 0.933182333,
|
66 |
+
"f": 0.9374288965
|
67 |
},
|
68 |
"cc": {
|
69 |
+
"p": 0.9206680585,
|
70 |
+
"r": 0.9206680585,
|
71 |
+
"f": 0.9206680585
|
72 |
},
|
73 |
"conj": {
|
74 |
+
"p": 0.5717948718,
|
75 |
+
"r": 0.516801854,
|
76 |
+
"f": 0.5429093122
|
77 |
},
|
78 |
"nummod": {
|
79 |
+
"p": 0.8808777429,
|
80 |
+
"r": 0.8726708075,
|
81 |
+
"f": 0.8767550702
|
82 |
},
|
83 |
"acl": {
|
84 |
+
"p": 0.7312138728,
|
85 |
+
"r": 0.65374677,
|
86 |
+
"f": 0.690313779
|
87 |
},
|
88 |
"advmod": {
|
89 |
+
"p": 0.774535809,
|
90 |
+
"r": 0.7704485488,
|
91 |
+
"f": 0.7724867725
|
92 |
},
|
93 |
"obl": {
|
94 |
+
"p": 0.6485042735,
|
95 |
+
"r": 0.7703045685,
|
96 |
+
"f": 0.7041763341
|
97 |
},
|
98 |
"expl:pass": {
|
99 |
+
"p": 0.8367346939,
|
100 |
+
"r": 0.7592592593,
|
101 |
+
"f": 0.7961165049
|
102 |
},
|
103 |
"nsubj:pass": {
|
104 |
+
"p": 0.8206896552,
|
105 |
+
"r": 0.7256097561,
|
106 |
+
"f": 0.7702265372
|
107 |
},
|
108 |
"fixed": {
|
109 |
+
"p": 0.867803838,
|
110 |
+
"r": 0.8604651163,
|
111 |
+
"f": 0.864118896
|
112 |
},
|
113 |
"appos": {
|
114 |
+
"p": 0.4136546185,
|
115 |
"r": 0.393129771,
|
116 |
+
"f": 0.4031311155
|
117 |
},
|
118 |
"parataxis": {
|
119 |
+
"p": 0.2272727273,
|
120 |
+
"r": 0.2857142857,
|
121 |
+
"f": 0.253164557
|
122 |
},
|
123 |
"aux:pass": {
|
124 |
+
"p": 0.9133333333,
|
125 |
"r": 0.9133333333,
|
126 |
+
"f": 0.9133333333
|
127 |
},
|
128 |
"nmod:agent": {
|
129 |
+
"p": 0.0,
|
130 |
+
"r": 0.0,
|
131 |
+
"f": 0.0
|
132 |
},
|
133 |
"ccomp": {
|
134 |
+
"p": 0.8548387097,
|
135 |
+
"r": 0.8217054264,
|
136 |
+
"f": 0.837944664
|
137 |
},
|
138 |
"nmod:pmod": {
|
139 |
+
"p": 0.0,
|
140 |
+
"r": 0.0,
|
141 |
+
"f": 0.0
|
142 |
},
|
143 |
"iobj": {
|
144 |
+
"p": 0.6842105263,
|
145 |
+
"r": 0.6419753086,
|
146 |
+
"f": 0.6624203822
|
147 |
},
|
148 |
"flat": {
|
149 |
+
"p": 0.7493333333,
|
150 |
+
"r": 0.7394736842,
|
151 |
+
"f": 0.7443708609
|
152 |
},
|
153 |
"cop": {
|
154 |
+
"p": 0.8034188034,
|
155 |
+
"r": 0.7580645161,
|
156 |
+
"f": 0.7800829876
|
157 |
},
|
158 |
"csubj": {
|
159 |
+
"p": 1.0,
|
160 |
+
"r": 0.8095238095,
|
161 |
+
"f": 0.8947368421
|
162 |
},
|
163 |
"dep": {
|
164 |
"p": 0.0,
|
165 |
"r": 0.0,
|
166 |
"f": 0.0
|
167 |
},
|
168 |
+
"obl:agent": {
|
169 |
+
"p": 0.0,
|
170 |
+
"r": 0.0,
|
171 |
+
"f": 0.0
|
172 |
+
},
|
173 |
+
"xcomp": {
|
174 |
+
"p": 0.3658536585,
|
175 |
+
"r": 0.5555555556,
|
176 |
+
"f": 0.4411764706
|
177 |
+
},
|
178 |
+
"obl:pmod": {
|
179 |
+
"p": 0.0,
|
180 |
+
"r": 0.0,
|
181 |
+
"f": 0.0
|
182 |
+
},
|
183 |
"expl:pv": {
|
184 |
+
"p": 0.7808219178,
|
185 |
+
"r": 0.8260869565,
|
186 |
+
"f": 0.8028169014
|
187 |
},
|
188 |
"expl": {
|
189 |
+
"p": 0.696969697,
|
190 |
+
"r": 0.8518518519,
|
191 |
+
"f": 0.7666666667
|
192 |
},
|
193 |
"vocative": {
|
194 |
"p": 0.0,
|
|
|
196 |
"f": 0.0
|
197 |
},
|
198 |
"expl:poss": {
|
199 |
+
"p": 1.0,
|
200 |
+
"r": 0.9677419355,
|
201 |
+
"f": 0.9836065574
|
202 |
},
|
203 |
"goeswith": {
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
204 |
"p": 0.0,
|
205 |
"r": 0.0,
|
206 |
"f": 0.0
|
207 |
},
|
208 |
+
"ccomp:pmod": {
|
209 |
+
"p": 0.0909090909,
|
210 |
+
"r": 0.3333333333,
|
211 |
+
"f": 0.1428571429
|
212 |
+
},
|
213 |
"orphan": {
|
214 |
"p": 0.0,
|
215 |
"r": 0.0,
|
216 |
"f": 0.0
|
217 |
},
|
218 |
"expl:impers": {
|
219 |
+
"p": 0.0,
|
220 |
+
"r": 0.0,
|
221 |
+
"f": 0.0
|
222 |
+
},
|
223 |
+
"compound": {
|
224 |
+
"p": 0.75,
|
225 |
+
"r": 0.4285714286,
|
226 |
+
"f": 0.5454545455
|
227 |
},
|
228 |
"list": {
|
229 |
"p": 0.0,
|
230 |
"r": 0.0,
|
231 |
"f": 0.0
|
232 |
},
|
233 |
+
"csubj:pass": {
|
234 |
+
"p": 0.0,
|
235 |
+
"r": 0.0,
|
236 |
+
"f": 0.0
|
237 |
+
},
|
238 |
"cc:preconj": {
|
239 |
"p": 0.0,
|
240 |
"r": 0.0,
|
241 |
"f": 0.0
|
242 |
+
}
|
243 |
+
},
|
244 |
+
"pos_acc": 0.9297553275,
|
245 |
+
"morph_acc": 0.9386929369,
|
246 |
+
"morph_micro_p": 0.9832628288,
|
247 |
+
"morph_micro_r": 0.9507350539,
|
248 |
+
"morph_micro_f": 0.9636742939,
|
249 |
+
"morph_per_feat": {
|
250 |
+
"Case": {
|
251 |
+
"p": 0.9877237852,
|
252 |
+
"r": 0.9823222689,
|
253 |
+
"f": 0.985015622
|
254 |
},
|
255 |
+
"Gender": {
|
256 |
+
"p": 0.9819398838,
|
257 |
+
"r": 0.9719964996,
|
258 |
+
"f": 0.9769428912
|
259 |
+
},
|
260 |
+
"Number": {
|
261 |
+
"p": 0.9836679143,
|
262 |
+
"r": 0.9136205625,
|
263 |
+
"f": 0.9473511742
|
264 |
+
},
|
265 |
+
"Person": {
|
266 |
+
"p": 0.982373678,
|
267 |
+
"r": 0.9852681202,
|
268 |
+
"f": 0.9838187702
|
269 |
+
},
|
270 |
+
"PronType": {
|
271 |
+
"p": 0.9937629938,
|
272 |
+
"r": 0.991015895,
|
273 |
+
"f": 0.9923875433
|
274 |
+
},
|
275 |
+
"Polarity": {
|
276 |
+
"p": 0.9918433931,
|
277 |
+
"r": 0.9967213115,
|
278 |
+
"f": 0.9942763696
|
279 |
+
},
|
280 |
+
"AdpType": {
|
281 |
+
"p": 0.999660787,
|
282 |
+
"r": 0.9972927242,
|
283 |
+
"f": 0.9984753515
|
284 |
+
},
|
285 |
+
"Definite": {
|
286 |
+
"p": 0.9843724827,
|
287 |
+
"r": 0.9698412698,
|
288 |
+
"f": 0.9770528504
|
289 |
+
},
|
290 |
+
"Degree": {
|
291 |
+
"p": 0.9400413508,
|
292 |
+
"r": 0.916051041,
|
293 |
+
"f": 0.9278911565
|
294 |
+
},
|
295 |
+
"VerbForm": {
|
296 |
+
"p": 0.9644736842,
|
297 |
+
"r": 0.9747340426,
|
298 |
+
"f": 0.9695767196
|
299 |
+
},
|
300 |
+
"Abbr": {
|
301 |
+
"p": 0.9754901961,
|
302 |
+
"r": 0.8883928571,
|
303 |
+
"f": 0.9299065421
|
304 |
+
},
|
305 |
+
"Poss": {
|
306 |
+
"p": 1.0,
|
307 |
+
"r": 0.9855421687,
|
308 |
+
"f": 0.9927184466
|
309 |
+
},
|
310 |
+
"NumForm": {
|
311 |
+
"p": 1.0,
|
312 |
+
"r": 0.3223140496,
|
313 |
+
"f": 0.4875
|
314 |
+
},
|
315 |
+
"NumType": {
|
316 |
+
"p": 1.0,
|
317 |
+
"r": 0.3241758242,
|
318 |
+
"f": 0.489626556
|
319 |
+
},
|
320 |
+
"Reflex": {
|
321 |
+
"p": 1.0,
|
322 |
+
"r": 0.9935897436,
|
323 |
+
"f": 0.9967845659
|
324 |
+
},
|
325 |
+
"Strength": {
|
326 |
+
"p": 0.9919354839,
|
327 |
+
"r": 0.9761904762,
|
328 |
+
"f": 0.984
|
329 |
+
},
|
330 |
+
"Mood": {
|
331 |
+
"p": 0.9584086799,
|
332 |
+
"r": 0.9724770642,
|
333 |
+
"f": 0.9653916211
|
334 |
+
},
|
335 |
+
"Tense": {
|
336 |
+
"p": 0.9596541787,
|
337 |
+
"r": 0.9680232558,
|
338 |
+
"f": 0.9638205499
|
339 |
+
},
|
340 |
+
"Variant": {
|
341 |
+
"p": 0.9865771812,
|
342 |
+
"r": 0.9483870968,
|
343 |
+
"f": 0.9671052632
|
344 |
+
},
|
345 |
+
"Position": {
|
346 |
+
"p": 0.9910714286,
|
347 |
+
"r": 0.9910714286,
|
348 |
+
"f": 0.9910714286
|
349 |
+
},
|
350 |
+
"Number[psor]": {
|
351 |
+
"p": 1.0,
|
352 |
+
"r": 0.9,
|
353 |
+
"f": 0.9473684211
|
354 |
+
},
|
355 |
+
"PartType": {
|
356 |
+
"p": 0.9722222222,
|
357 |
+
"r": 0.9459459459,
|
358 |
+
"f": 0.9589041096
|
359 |
+
},
|
360 |
+
"Foreign": {
|
361 |
"p": 0.0,
|
362 |
"r": 0.0,
|
363 |
"f": 0.0
|
364 |
}
|
365 |
},
|
366 |
+
"lemma_acc": 0.8183070924,
|
367 |
+
"ents_p": 0.7193937038,
|
368 |
+
"ents_r": 0.711102574,
|
369 |
+
"ents_f": 0.7152241113,
|
370 |
"ents_per_type": {
|
371 |
"DATETIME": {
|
372 |
+
"p": 0.7883959044,
|
373 |
+
"r": 0.8048780488,
|
374 |
+
"f": 0.7965517241
|
375 |
},
|
376 |
"ORGANIZATION": {
|
377 |
+
"p": 0.6325301205,
|
378 |
+
"r": 0.6687898089,
|
379 |
+
"f": 0.6501547988
|
380 |
},
|
381 |
"FACILITY": {
|
382 |
+
"p": 0.4862385321,
|
383 |
"r": 0.4045801527,
|
384 |
+
"f": 0.4416666667
|
385 |
},
|
386 |
"NUMERIC_VALUE": {
|
387 |
+
"p": 0.9227467811,
|
388 |
+
"r": 0.9110169492,
|
389 |
+
"f": 0.9168443497
|
390 |
},
|
391 |
"ORDINAL": {
|
392 |
+
"p": 0.7857142857,
|
393 |
+
"r": 0.8,
|
394 |
+
"f": 0.7927927928
|
395 |
},
|
396 |
"EVENT": {
|
397 |
+
"p": 0.5294117647,
|
398 |
+
"r": 0.4864864865,
|
399 |
+
"f": 0.5070422535
|
400 |
},
|
401 |
"GPE": {
|
402 |
+
"p": 0.7953586498,
|
403 |
+
"r": 0.8666666667,
|
404 |
+
"f": 0.8294829483
|
405 |
},
|
406 |
"PERSON": {
|
407 |
+
"p": 0.6728187919,
|
408 |
+
"r": 0.6728187919,
|
409 |
+
"f": 0.6728187919
|
410 |
},
|
411 |
"NAT_REL_POL": {
|
412 |
+
"p": 0.8854961832,
|
413 |
+
"r": 0.7733333333,
|
414 |
+
"f": 0.8256227758
|
415 |
},
|
416 |
"MONEY": {
|
417 |
+
"p": 0.9230769231,
|
418 |
+
"r": 0.8275862069,
|
419 |
+
"f": 0.8727272727
|
420 |
},
|
421 |
"PRODUCT": {
|
422 |
+
"p": 0.427184466,
|
423 |
+
"r": 0.3211678832,
|
424 |
+
"f": 0.3666666667
|
425 |
},
|
426 |
"LOC": {
|
427 |
+
"p": 0.4782608696,
|
428 |
+
"r": 0.4342105263,
|
429 |
+
"f": 0.4551724138
|
430 |
},
|
431 |
"WORK_OF_ART": {
|
432 |
+
"p": 0.2222222222,
|
433 |
+
"r": 0.2105263158,
|
434 |
+
"f": 0.2162162162
|
435 |
},
|
436 |
"QUANTITY": {
|
437 |
+
"p": 0.7419354839,
|
438 |
"r": 0.8846153846,
|
439 |
+
"f": 0.8070175439
|
440 |
},
|
441 |
"LANGUAGE": {
|
442 |
+
"p": 0.6,
|
443 |
+
"r": 0.75,
|
444 |
+
"f": 0.6666666667
|
445 |
},
|
446 |
"PERIOD": {
|
447 |
+
"p": 0.8378378378,
|
448 |
+
"r": 0.7380952381,
|
449 |
+
"f": 0.7848101266
|
450 |
}
|
451 |
+
},
|
452 |
+
"speed": 8496.149813704
|
453 |
}
|
attribute_ruler/patterns
CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
|
|
config.cfg
CHANGED
@@ -1,10 +1,8 @@
|
|
1 |
[paths]
|
2 |
-
train =
|
3 |
-
dev =
|
4 |
vectors = null
|
5 |
-
raw = null
|
6 |
init_tok2vec = null
|
7 |
-
vocab_data = null
|
8 |
|
9 |
[system]
|
10 |
gpu_allocator = null
|
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
|
|
24 |
|
25 |
[components.attribute_ruler]
|
26 |
factory = "attribute_ruler"
|
|
|
27 |
validate = false
|
28 |
|
29 |
[components.lemmatizer]
|
@@ -31,11 +30,13 @@ factory = "lemmatizer"
|
|
31 |
mode = "lookup"
|
32 |
model = null
|
33 |
overwrite = false
|
|
|
34 |
|
35 |
[components.ner]
|
36 |
factory = "ner"
|
37 |
incorrect_spans_key = null
|
38 |
moves = null
|
|
|
39 |
update_with_oracle_cut_size = 100
|
40 |
|
41 |
[components.ner.model]
|
@@ -53,8 +54,8 @@ nO = null
|
|
53 |
[components.ner.model.tok2vec.embed]
|
54 |
@architectures = "spacy.MultiHashEmbed.v2"
|
55 |
width = 96
|
56 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
57 |
-
rows = [5000,2500,2500,2500]
|
58 |
include_static_vectors = false
|
59 |
|
60 |
[components.ner.model.tok2vec.encode]
|
@@ -69,6 +70,7 @@ factory = "parser"
|
|
69 |
learn_tokens = false
|
70 |
min_action_freq = 30
|
71 |
moves = null
|
|
|
72 |
update_with_oracle_cut_size = 100
|
73 |
|
74 |
[components.parser.model]
|
@@ -87,6 +89,8 @@ upstream = "tok2vec"
|
|
87 |
|
88 |
[components.senter]
|
89 |
factory = "senter"
|
|
|
|
|
90 |
|
91 |
[components.senter.model]
|
92 |
@architectures = "spacy.Tagger.v1"
|
@@ -98,8 +102,8 @@ nO = null
|
|
98 |
[components.senter.model.tok2vec.embed]
|
99 |
@architectures = "spacy.MultiHashEmbed.v2"
|
100 |
width = 16
|
101 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
102 |
-
rows = [1000,500,500,500]
|
103 |
include_static_vectors = false
|
104 |
|
105 |
[components.senter.model.tok2vec.encode]
|
@@ -111,6 +115,8 @@ maxout_pieces = 2
|
|
111 |
|
112 |
[components.tagger]
|
113 |
factory = "tagger"
|
|
|
|
|
114 |
|
115 |
[components.tagger.model]
|
116 |
@architectures = "spacy.Tagger.v1"
|
@@ -130,8 +136,8 @@ factory = "tok2vec"
|
|
130 |
[components.tok2vec.model.embed]
|
131 |
@architectures = "spacy.MultiHashEmbed.v2"
|
132 |
width = ${components.tok2vec.model.encode:width}
|
133 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
134 |
-
rows = [5000,2500,2500,2500]
|
135 |
include_static_vectors = false
|
136 |
|
137 |
[components.tok2vec.model.encode]
|
@@ -145,22 +151,19 @@ maxout_pieces = 3
|
|
145 |
|
146 |
[corpora.dev]
|
147 |
@readers = "spacy.Corpus.v1"
|
148 |
-
|
149 |
-
max_length = 0
|
150 |
-
path = ${paths:dev}
|
151 |
gold_preproc = false
|
|
|
|
|
152 |
augmenter = null
|
153 |
|
154 |
[corpora.train]
|
155 |
@readers = "spacy.Corpus.v1"
|
156 |
-
path = ${paths
|
157 |
-
max_length = 5000
|
158 |
gold_preproc = false
|
|
|
159 |
limit = 0
|
160 |
-
|
161 |
-
[corpora.train.augmenter]
|
162 |
-
@augmenters = "spacy.lower_case.v1"
|
163 |
-
level = 0.1
|
164 |
|
165 |
[training]
|
166 |
train_corpus = "corpora.train"
|
@@ -191,9 +194,8 @@ compound = 1.001
|
|
191 |
t = 0.0
|
192 |
|
193 |
[training.logger]
|
194 |
-
@loggers = "spacy.
|
195 |
-
|
196 |
-
remove_config_values = []
|
197 |
|
198 |
[training.optimizer]
|
199 |
@optimizers = "Adam.v1"
|
@@ -214,16 +216,17 @@ dep_las_per_type = null
|
|
214 |
sents_p = null
|
215 |
sents_r = null
|
216 |
sents_f = 0.02
|
217 |
-
lemma_acc = 0.
|
218 |
-
ents_f = 0.
|
219 |
ents_p = 0.0
|
220 |
ents_r = 0.0
|
221 |
ents_per_type = null
|
|
|
222 |
|
223 |
[pretraining]
|
224 |
|
225 |
[initialize]
|
226 |
-
vocab_data =
|
227 |
vectors = ${paths.vectors}
|
228 |
init_tok2vec = ${paths.init_tok2vec}
|
229 |
before_init = null
|
|
|
1 |
[paths]
|
2 |
+
train = null
|
3 |
+
dev = null
|
4 |
vectors = null
|
|
|
5 |
init_tok2vec = null
|
|
|
6 |
|
7 |
[system]
|
8 |
gpu_allocator = null
|
|
|
22 |
|
23 |
[components.attribute_ruler]
|
24 |
factory = "attribute_ruler"
|
25 |
+
scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
|
26 |
validate = false
|
27 |
|
28 |
[components.lemmatizer]
|
|
|
30 |
mode = "lookup"
|
31 |
model = null
|
32 |
overwrite = false
|
33 |
+
scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
|
34 |
|
35 |
[components.ner]
|
36 |
factory = "ner"
|
37 |
incorrect_spans_key = null
|
38 |
moves = null
|
39 |
+
scorer = {"@scorers":"spacy.ner_scorer.v1"}
|
40 |
update_with_oracle_cut_size = 100
|
41 |
|
42 |
[components.ner.model]
|
|
|
54 |
[components.ner.model.tok2vec.embed]
|
55 |
@architectures = "spacy.MultiHashEmbed.v2"
|
56 |
width = 96
|
57 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
58 |
+
rows = [5000,2500,2500,2500,100]
|
59 |
include_static_vectors = false
|
60 |
|
61 |
[components.ner.model.tok2vec.encode]
|
|
|
70 |
learn_tokens = false
|
71 |
min_action_freq = 30
|
72 |
moves = null
|
73 |
+
scorer = {"@scorers":"spacy.parser_scorer.v1"}
|
74 |
update_with_oracle_cut_size = 100
|
75 |
|
76 |
[components.parser.model]
|
|
|
89 |
|
90 |
[components.senter]
|
91 |
factory = "senter"
|
92 |
+
overwrite = false
|
93 |
+
scorer = {"@scorers":"spacy.senter_scorer.v1"}
|
94 |
|
95 |
[components.senter.model]
|
96 |
@architectures = "spacy.Tagger.v1"
|
|
|
102 |
[components.senter.model.tok2vec.embed]
|
103 |
@architectures = "spacy.MultiHashEmbed.v2"
|
104 |
width = 16
|
105 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
106 |
+
rows = [1000,500,500,500,50]
|
107 |
include_static_vectors = false
|
108 |
|
109 |
[components.senter.model.tok2vec.encode]
|
|
|
115 |
|
116 |
[components.tagger]
|
117 |
factory = "tagger"
|
118 |
+
overwrite = false
|
119 |
+
scorer = {"@scorers":"spacy.tagger_scorer.v1"}
|
120 |
|
121 |
[components.tagger.model]
|
122 |
@architectures = "spacy.Tagger.v1"
|
|
|
136 |
[components.tok2vec.model.embed]
|
137 |
@architectures = "spacy.MultiHashEmbed.v2"
|
138 |
width = ${components.tok2vec.model.encode:width}
|
139 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
140 |
+
rows = [5000,2500,2500,2500,100]
|
141 |
include_static_vectors = false
|
142 |
|
143 |
[components.tok2vec.model.encode]
|
|
|
151 |
|
152 |
[corpora.dev]
|
153 |
@readers = "spacy.Corpus.v1"
|
154 |
+
path = ${paths.dev}
|
|
|
|
|
155 |
gold_preproc = false
|
156 |
+
max_length = 0
|
157 |
+
limit = 0
|
158 |
augmenter = null
|
159 |
|
160 |
[corpora.train]
|
161 |
@readers = "spacy.Corpus.v1"
|
162 |
+
path = ${paths.train}
|
|
|
163 |
gold_preproc = false
|
164 |
+
max_length = 0
|
165 |
limit = 0
|
166 |
+
augmenter = null
|
|
|
|
|
|
|
167 |
|
168 |
[training]
|
169 |
train_corpus = "corpora.train"
|
|
|
194 |
t = 0.0
|
195 |
|
196 |
[training.logger]
|
197 |
+
@loggers = "spacy.ConsoleLogger.v1"
|
198 |
+
progress_bar = false
|
|
|
199 |
|
200 |
[training.optimizer]
|
201 |
@optimizers = "Adam.v1"
|
|
|
216 |
sents_p = null
|
217 |
sents_r = null
|
218 |
sents_f = 0.02
|
219 |
+
lemma_acc = 0.5
|
220 |
+
ents_f = 0.16
|
221 |
ents_p = 0.0
|
222 |
ents_r = 0.0
|
223 |
ents_per_type = null
|
224 |
+
speed = 0.0
|
225 |
|
226 |
[pretraining]
|
227 |
|
228 |
[initialize]
|
229 |
+
vocab_data = null
|
230 |
vectors = ${paths.vectors}
|
231 |
init_tok2vec = ${paths.init_tok2vec}
|
232 |
before_init = null
|
meta.json
CHANGED
@@ -1,14 +1,14 @@
|
|
1 |
{
|
2 |
"lang":"ro",
|
3 |
"name":"core_news_sm",
|
4 |
-
"version":"3.
|
5 |
"description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
|
6 |
"author":"Explosion",
|
7 |
"email":"[email protected]",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
-
"spacy_version":">=3.
|
11 |
-
"spacy_git_version":"
|
12 |
"vectors":{
|
13 |
"width":0,
|
14 |
"vectors":0,
|
@@ -30,6 +30,7 @@
|
|
30 |
"Afp",
|
31 |
"Afp-p-n",
|
32 |
"Afp-poy",
|
|
|
33 |
"Afpf--n",
|
34 |
"Afpfp-n",
|
35 |
"Afpfp-ny",
|
@@ -131,6 +132,7 @@
|
|
131 |
"Ds2ms-s",
|
132 |
"Ds3---p",
|
133 |
"Ds3---s",
|
|
|
134 |
"Ds3fp-s",
|
135 |
"Ds3fsos",
|
136 |
"Ds3fsrs",
|
@@ -159,18 +161,23 @@
|
|
159 |
"LSQR",
|
160 |
"LT",
|
161 |
"M",
|
162 |
-
"Mc",
|
163 |
"Mc-p-d",
|
164 |
"Mc-p-l",
|
|
|
|
|
|
|
165 |
"Mcfp-l",
|
166 |
"Mcfp-ln",
|
167 |
"Mcfprln",
|
168 |
"Mcfprly",
|
169 |
"Mcfsoln",
|
|
|
170 |
"Mcfsrln",
|
|
|
171 |
"Mcmp-l",
|
172 |
"Mcms-ln",
|
173 |
"Mcmsrl",
|
|
|
174 |
"Mcmsrly",
|
175 |
"Mffprln",
|
176 |
"Mffsrln",
|
@@ -243,7 +250,6 @@
|
|
243 |
"Pd3mpr--y",
|
244 |
"Pd3mso",
|
245 |
"Pd3msr",
|
246 |
-
"Pi3",
|
247 |
"Pi3--r",
|
248 |
"Pi3-po",
|
249 |
"Pi3-so",
|
@@ -289,6 +295,7 @@
|
|
289 |
"Pp3-po--------s",
|
290 |
"Pp3-sd--------w",
|
291 |
"Pp3-sd--y-----w",
|
|
|
292 |
"Pp3fpa--------w",
|
293 |
"Pp3fpa--y-----w",
|
294 |
"Pp3fpr--------s",
|
@@ -315,7 +322,6 @@
|
|
315 |
"Ps2fp-s",
|
316 |
"Ps2fsrp",
|
317 |
"Ps2fsrs",
|
318 |
-
"Ps2ms-s",
|
319 |
"Ps3---p",
|
320 |
"Ps3---s",
|
321 |
"Ps3fp-s",
|
@@ -348,7 +354,6 @@
|
|
348 |
"RPAR",
|
349 |
"RSQR",
|
350 |
"Rc",
|
351 |
-
"Rgc",
|
352 |
"Rgp",
|
353 |
"Rgpy",
|
354 |
"Rgs",
|
@@ -406,6 +411,7 @@
|
|
406 |
"Va--3s",
|
407 |
"Va--3s----y",
|
408 |
"Vag",
|
|
|
409 |
"Vaii1",
|
410 |
"Vaii2s",
|
411 |
"Vaii3p",
|
@@ -475,7 +481,7 @@
|
|
475 |
"Vmp--sm",
|
476 |
"Vmp--sm---y",
|
477 |
"Vmsp1p",
|
478 |
-
"
|
479 |
"Vmsp2s",
|
480 |
"Vmsp3",
|
481 |
"Vmsp3-----y",
|
@@ -488,6 +494,7 @@
|
|
488 |
"Ynmsoy",
|
489 |
"Ynmsry",
|
490 |
"Yp",
|
|
|
491 |
"Yp-sr",
|
492 |
"Yr"
|
493 |
],
|
@@ -525,14 +532,14 @@
|
|
525 |
"iobj",
|
526 |
"mark",
|
527 |
"nmod",
|
528 |
-
"nmod:agent",
|
529 |
-
"nmod:pmod",
|
530 |
"nmod:tmod",
|
531 |
"nsubj",
|
532 |
"nsubj:pass",
|
533 |
"nummod",
|
534 |
"obj",
|
535 |
"obl",
|
|
|
|
|
536 |
"orphan",
|
537 |
"parataxis",
|
538 |
"punct",
|
@@ -590,298 +597,195 @@
|
|
590 |
],
|
591 |
"performance":{
|
592 |
"token_acc":0.9990029326,
|
593 |
-
"
|
594 |
-
"
|
595 |
-
"
|
596 |
-
"
|
597 |
-
"
|
598 |
-
"
|
599 |
-
"
|
600 |
-
"
|
601 |
-
"
|
602 |
-
"speed":9573.3292142848,
|
603 |
-
"morph_per_feat":{
|
604 |
-
"Case":{
|
605 |
-
"p":0.985836417,
|
606 |
-
"r":0.9825766247,
|
607 |
-
"f":0.9842038217
|
608 |
-
},
|
609 |
-
"Gender":{
|
610 |
-
"p":0.9825625474,
|
611 |
-
"r":0.9721215152,
|
612 |
-
"f":0.9773141457
|
613 |
-
},
|
614 |
-
"Number":{
|
615 |
-
"p":0.9856352415,
|
616 |
-
"r":0.9757716212,
|
617 |
-
"f":0.98067863
|
618 |
-
},
|
619 |
-
"Person":{
|
620 |
-
"p":0.9789227166,
|
621 |
-
"r":0.9852681202,
|
622 |
-
"f":0.9820851689
|
623 |
-
},
|
624 |
-
"PronType":{
|
625 |
-
"p":0.9951523546,
|
626 |
-
"r":0.99308915,
|
627 |
-
"f":0.9941196818
|
628 |
-
},
|
629 |
-
"Polarity":{
|
630 |
-
"p":0.9918166939,
|
631 |
-
"r":0.993442623,
|
632 |
-
"f":0.9926289926
|
633 |
-
},
|
634 |
-
"AdpType":{
|
635 |
-
"p":0.9976311337,
|
636 |
-
"r":0.9976311337,
|
637 |
-
"f":0.9976311337
|
638 |
-
},
|
639 |
-
"Definite":{
|
640 |
-
"p":0.9854908915,
|
641 |
-
"r":0.9703174603,
|
642 |
-
"f":0.9778453171
|
643 |
-
},
|
644 |
-
"Degree":{
|
645 |
-
"p":0.9436038514,
|
646 |
-
"r":0.9214237743,
|
647 |
-
"f":0.9323819232
|
648 |
-
},
|
649 |
-
"VerbForm":{
|
650 |
-
"p":0.9632545932,
|
651 |
-
"r":0.9760638298,
|
652 |
-
"f":0.9696169089
|
653 |
-
},
|
654 |
-
"Abbr":{
|
655 |
-
"p":0.9804878049,
|
656 |
-
"r":0.8973214286,
|
657 |
-
"f":0.9370629371
|
658 |
-
},
|
659 |
-
"Poss":{
|
660 |
-
"p":1.0,
|
661 |
-
"r":0.9951807229,
|
662 |
-
"f":0.9975845411
|
663 |
-
},
|
664 |
-
"NumForm":{
|
665 |
-
"p":0.9944134078,
|
666 |
-
"r":0.9807162534,
|
667 |
-
"f":0.987517337
|
668 |
-
},
|
669 |
-
"NumType":{
|
670 |
-
"p":0.995821727,
|
671 |
-
"r":0.9821428571,
|
672 |
-
"f":0.9889349931
|
673 |
-
},
|
674 |
-
"Reflex":{
|
675 |
-
"p":1.0,
|
676 |
-
"r":0.9935897436,
|
677 |
-
"f":0.9967845659
|
678 |
-
},
|
679 |
-
"Strength":{
|
680 |
-
"p":0.9841269841,
|
681 |
-
"r":0.9841269841,
|
682 |
-
"f":0.9841269841
|
683 |
-
},
|
684 |
-
"Mood":{
|
685 |
-
"p":0.9446428571,
|
686 |
-
"r":0.9706422018,
|
687 |
-
"f":0.9574660633
|
688 |
-
},
|
689 |
-
"Tense":{
|
690 |
-
"p":0.9470672389,
|
691 |
-
"r":0.9622093023,
|
692 |
-
"f":0.9545782264
|
693 |
-
},
|
694 |
-
"Variant":{
|
695 |
-
"p":0.9867549669,
|
696 |
-
"r":0.9612903226,
|
697 |
-
"f":0.9738562092
|
698 |
-
},
|
699 |
-
"Position":{
|
700 |
-
"p":1.0,
|
701 |
-
"r":0.9910714286,
|
702 |
-
"f":0.9955156951
|
703 |
-
},
|
704 |
-
"Number[psor]":{
|
705 |
-
"p":1.0,
|
706 |
-
"r":0.9666666667,
|
707 |
-
"f":0.9830508475
|
708 |
-
},
|
709 |
-
"PartType":{
|
710 |
-
"p":1.0,
|
711 |
-
"r":0.9459459459,
|
712 |
-
"f":0.9722222222
|
713 |
-
},
|
714 |
-
"Foreign":{
|
715 |
-
"p":0.0,
|
716 |
-
"r":0.0,
|
717 |
-
"f":0.0
|
718 |
-
}
|
719 |
-
},
|
720 |
"dep_las_per_type":{
|
721 |
"root":{
|
722 |
-
"p":0.
|
723 |
-
"r":0.
|
724 |
-
"f":0.
|
725 |
},
|
726 |
"mark":{
|
727 |
-
"p":0.
|
728 |
-
"r":0.
|
729 |
-
"f":0.
|
730 |
},
|
731 |
"case":{
|
732 |
-
"p":0.
|
733 |
-
"r":0.
|
734 |
-
"f":0.
|
735 |
},
|
736 |
"nmod:tmod":{
|
737 |
-
"p":0.
|
738 |
-
"r":0.
|
739 |
-
"f":0.
|
740 |
},
|
741 |
"amod":{
|
742 |
-
"p":0.
|
743 |
-
"r":0.
|
744 |
-
"f":0.
|
745 |
},
|
746 |
"nsubj":{
|
747 |
-
"p":0.
|
748 |
"r":0.7914691943,
|
749 |
-
"f":0.
|
750 |
},
|
751 |
"nmod":{
|
752 |
-
"p":0.
|
753 |
-
"r":0.
|
754 |
-
"f":0.
|
755 |
},
|
756 |
"aux":{
|
757 |
-
"p":0.
|
758 |
-
"r":0.
|
759 |
-
"f":0.
|
760 |
},
|
761 |
"advcl":{
|
762 |
-
"p":0.
|
763 |
-
"r":0.
|
764 |
-
"f":0.
|
765 |
},
|
766 |
"obj":{
|
767 |
-
"p":0.
|
768 |
-
"r":0.
|
769 |
-
"f":0.
|
770 |
},
|
771 |
"det":{
|
772 |
-
"p":0.
|
773 |
-
"r":0.
|
774 |
-
"f":0.
|
775 |
},
|
776 |
"cc":{
|
777 |
-
"p":0.
|
778 |
-
"r":0.
|
779 |
-
"f":0.
|
780 |
},
|
781 |
"conj":{
|
782 |
-
"p":0.
|
783 |
-
"r":0.
|
784 |
-
"f":0.
|
785 |
},
|
786 |
"nummod":{
|
787 |
-
"p":0.
|
788 |
-
"r":0.
|
789 |
-
"f":0.
|
790 |
},
|
791 |
"acl":{
|
792 |
-
"p":0.
|
793 |
-
"r":0.
|
794 |
-
"f":0.
|
795 |
},
|
796 |
"advmod":{
|
797 |
-
"p":0.
|
798 |
-
"r":0.
|
799 |
-
"f":0.
|
800 |
},
|
801 |
"obl":{
|
802 |
-
"p":0.
|
803 |
-
"r":0.
|
804 |
-
"f":0.
|
805 |
},
|
806 |
"expl:pass":{
|
807 |
-
"p":0.
|
808 |
-
"r":0.
|
809 |
-
"f":0.
|
810 |
},
|
811 |
"nsubj:pass":{
|
812 |
-
"p":0.
|
813 |
-
"r":0.
|
814 |
-
"f":0.
|
815 |
},
|
816 |
"fixed":{
|
817 |
-
"p":0.
|
818 |
-
"r":0.
|
819 |
-
"f":0.
|
820 |
},
|
821 |
"appos":{
|
822 |
-
"p":0.
|
823 |
"r":0.393129771,
|
824 |
-
"f":0.
|
825 |
},
|
826 |
"parataxis":{
|
827 |
-
"p":0.
|
828 |
-
"r":0.
|
829 |
-
"f":0.
|
830 |
},
|
831 |
"aux:pass":{
|
832 |
-
"p":0.
|
833 |
"r":0.9133333333,
|
834 |
-
"f":0.
|
835 |
},
|
836 |
"nmod:agent":{
|
837 |
-
"p":0.
|
838 |
-
"r":0.
|
839 |
-
"f":0.
|
840 |
},
|
841 |
"ccomp":{
|
842 |
-
"p":0.
|
843 |
-
"r":0.
|
844 |
-
"f":0.
|
845 |
},
|
846 |
"nmod:pmod":{
|
847 |
-
"p":0.
|
848 |
-
"r":0.
|
849 |
-
"f":0.
|
850 |
},
|
851 |
"iobj":{
|
852 |
-
"p":0.
|
853 |
-
"r":0.
|
854 |
-
"f":0.
|
855 |
},
|
856 |
"flat":{
|
857 |
-
"p":0.
|
858 |
-
"r":0.
|
859 |
-
"f":0.
|
860 |
},
|
861 |
"cop":{
|
862 |
-
"p":0.
|
863 |
-
"r":0.
|
864 |
-
"f":0.
|
865 |
},
|
866 |
"csubj":{
|
867 |
-
"p":0
|
868 |
-
"r":0.
|
869 |
-
"f":0.
|
870 |
},
|
871 |
"dep":{
|
872 |
"p":0.0,
|
873 |
"r":0.0,
|
874 |
"f":0.0
|
875 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
876 |
"expl:pv":{
|
877 |
-
"p":0.
|
878 |
-
"r":0.
|
879 |
-
"f":0.
|
880 |
},
|
881 |
"expl":{
|
882 |
-
"p":0.
|
883 |
-
"r":0.
|
884 |
-
"f":0.
|
885 |
},
|
886 |
"vocative":{
|
887 |
"p":0.0,
|
@@ -889,141 +793,260 @@
|
|
889 |
"f":0.0
|
890 |
},
|
891 |
"expl:poss":{
|
892 |
-
"p":0
|
893 |
-
"r":0.
|
894 |
-
"f":0.
|
895 |
},
|
896 |
"goeswith":{
|
897 |
-
"p":0.1,
|
898 |
-
"r":0.3333333333,
|
899 |
-
"f":0.1538461538
|
900 |
-
},
|
901 |
-
"xcomp":{
|
902 |
-
"p":0.325,
|
903 |
-
"r":0.4814814815,
|
904 |
-
"f":0.3880597015
|
905 |
-
},
|
906 |
-
"compound":{
|
907 |
-
"p":0.3333333333,
|
908 |
-
"r":0.5714285714,
|
909 |
-
"f":0.4210526316
|
910 |
-
},
|
911 |
-
"ccomp:pmod":{
|
912 |
"p":0.0,
|
913 |
"r":0.0,
|
914 |
"f":0.0
|
915 |
},
|
|
|
|
|
|
|
|
|
|
|
916 |
"orphan":{
|
917 |
"p":0.0,
|
918 |
"r":0.0,
|
919 |
"f":0.0
|
920 |
},
|
921 |
"expl:impers":{
|
922 |
-
"p":0.
|
923 |
-
"r":0.
|
924 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
925 |
},
|
926 |
"list":{
|
927 |
"p":0.0,
|
928 |
"r":0.0,
|
929 |
"f":0.0
|
930 |
},
|
|
|
|
|
|
|
|
|
|
|
931 |
"cc:preconj":{
|
932 |
"p":0.0,
|
933 |
"r":0.0,
|
934 |
"f":0.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
935 |
},
|
936 |
-
"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
937 |
"p":0.0,
|
938 |
"r":0.0,
|
939 |
"f":0.0
|
940 |
}
|
941 |
},
|
942 |
-
"
|
943 |
-
"
|
944 |
-
"
|
|
|
945 |
"ents_per_type":{
|
946 |
"DATETIME":{
|
947 |
-
"p":0.
|
948 |
-
"r":0.
|
949 |
-
"f":0.
|
950 |
},
|
951 |
"ORGANIZATION":{
|
952 |
-
"p":0.
|
953 |
-
"r":0.
|
954 |
-
"f":0.
|
955 |
},
|
956 |
"FACILITY":{
|
957 |
-
"p":0.
|
958 |
"r":0.4045801527,
|
959 |
-
"f":0.
|
960 |
},
|
961 |
"NUMERIC_VALUE":{
|
962 |
-
"p":0.
|
963 |
-
"r":0.
|
964 |
-
"f":0.
|
965 |
},
|
966 |
"ORDINAL":{
|
967 |
-
"p":0.
|
968 |
-
"r":0.
|
969 |
-
"f":0.
|
970 |
},
|
971 |
"EVENT":{
|
972 |
-
"p":0.
|
973 |
-
"r":0.
|
974 |
-
"f":0.
|
975 |
},
|
976 |
"GPE":{
|
977 |
-
"p":0.
|
978 |
-
"r":0.
|
979 |
-
"f":0.
|
980 |
},
|
981 |
"PERSON":{
|
982 |
-
"p":0.
|
983 |
-
"r":0.
|
984 |
-
"f":0.
|
985 |
},
|
986 |
"NAT_REL_POL":{
|
987 |
-
"p":0.
|
988 |
-
"r":0.
|
989 |
-
"f":0.
|
990 |
},
|
991 |
"MONEY":{
|
992 |
-
"p":0.
|
993 |
-
"r":0.
|
994 |
-
"f":0.
|
995 |
},
|
996 |
"PRODUCT":{
|
997 |
-
"p":0.
|
998 |
-
"r":0.
|
999 |
-
"f":0.
|
1000 |
},
|
1001 |
"LOC":{
|
1002 |
-
"p":0.
|
1003 |
-
"r":0.
|
1004 |
-
"f":0.
|
1005 |
},
|
1006 |
"WORK_OF_ART":{
|
1007 |
-
"p":0.
|
1008 |
-
"r":0.
|
1009 |
-
"f":0.
|
1010 |
},
|
1011 |
"QUANTITY":{
|
1012 |
-
"p":0.
|
1013 |
"r":0.8846153846,
|
1014 |
-
"f":0.
|
1015 |
},
|
1016 |
"LANGUAGE":{
|
1017 |
-
"p":0.
|
1018 |
-
"r":
|
1019 |
-
"f":0.
|
1020 |
},
|
1021 |
"PERIOD":{
|
1022 |
-
"p":0.
|
1023 |
-
"r":0.
|
1024 |
-
"f":0.
|
1025 |
}
|
1026 |
-
}
|
|
|
1027 |
},
|
1028 |
"sources":[
|
1029 |
{
|
@@ -1033,7 +1056,7 @@
|
|
1033 |
"author":"Michal M\u011bchura"
|
1034 |
},
|
1035 |
{
|
1036 |
-
"name":"UD Romanian RRT v2.
|
1037 |
"url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
|
1038 |
"license":"CC BY-SA 4.0",
|
1039 |
"author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
|
|
|
1 |
{
|
2 |
"lang":"ro",
|
3 |
"name":"core_news_sm",
|
4 |
+
"version":"3.2.0",
|
5 |
"description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
|
6 |
"author":"Explosion",
|
7 |
"email":"[email protected]",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
+
"spacy_version":">=3.2.0,<3.3.0",
|
11 |
+
"spacy_git_version":"bb26550e2",
|
12 |
"vectors":{
|
13 |
"width":0,
|
14 |
"vectors":0,
|
|
|
30 |
"Afp",
|
31 |
"Afp-p-n",
|
32 |
"Afp-poy",
|
33 |
+
"Afp-srn",
|
34 |
"Afpf--n",
|
35 |
"Afpfp-n",
|
36 |
"Afpfp-ny",
|
|
|
132 |
"Ds2ms-s",
|
133 |
"Ds3---p",
|
134 |
"Ds3---s",
|
135 |
+
"Ds3---sy",
|
136 |
"Ds3fp-s",
|
137 |
"Ds3fsos",
|
138 |
"Ds3fsrs",
|
|
|
161 |
"LSQR",
|
162 |
"LT",
|
163 |
"M",
|
|
|
164 |
"Mc-p-d",
|
165 |
"Mc-p-l",
|
166 |
+
"Mc-s-b",
|
167 |
+
"Mc-s-d",
|
168 |
+
"Mc-s-l",
|
169 |
"Mcfp-l",
|
170 |
"Mcfp-ln",
|
171 |
"Mcfprln",
|
172 |
"Mcfprly",
|
173 |
"Mcfsoln",
|
174 |
+
"Mcfsrl",
|
175 |
"Mcfsrln",
|
176 |
+
"Mcfsrly",
|
177 |
"Mcmp-l",
|
178 |
"Mcms-ln",
|
179 |
"Mcmsrl",
|
180 |
+
"Mcmsrln",
|
181 |
"Mcmsrly",
|
182 |
"Mffprln",
|
183 |
"Mffsrln",
|
|
|
250 |
"Pd3mpr--y",
|
251 |
"Pd3mso",
|
252 |
"Pd3msr",
|
|
|
253 |
"Pi3--r",
|
254 |
"Pi3-po",
|
255 |
"Pi3-so",
|
|
|
295 |
"Pp3-po--------s",
|
296 |
"Pp3-sd--------w",
|
297 |
"Pp3-sd--y-----w",
|
298 |
+
"Pp3-so--------s",
|
299 |
"Pp3fpa--------w",
|
300 |
"Pp3fpa--y-----w",
|
301 |
"Pp3fpr--------s",
|
|
|
322 |
"Ps2fp-s",
|
323 |
"Ps2fsrp",
|
324 |
"Ps2fsrs",
|
|
|
325 |
"Ps3---p",
|
326 |
"Ps3---s",
|
327 |
"Ps3fp-s",
|
|
|
354 |
"RPAR",
|
355 |
"RSQR",
|
356 |
"Rc",
|
|
|
357 |
"Rgp",
|
358 |
"Rgpy",
|
359 |
"Rgs",
|
|
|
411 |
"Va--3s",
|
412 |
"Va--3s----y",
|
413 |
"Vag",
|
414 |
+
"Vag-------y",
|
415 |
"Vaii1",
|
416 |
"Vaii2s",
|
417 |
"Vaii3p",
|
|
|
481 |
"Vmp--sm",
|
482 |
"Vmp--sm---y",
|
483 |
"Vmsp1p",
|
484 |
+
"Vmsp2p",
|
485 |
"Vmsp2s",
|
486 |
"Vmsp3",
|
487 |
"Vmsp3-----y",
|
|
|
494 |
"Ynmsoy",
|
495 |
"Ynmsry",
|
496 |
"Yp",
|
497 |
+
"Yp,Yn",
|
498 |
"Yp-sr",
|
499 |
"Yr"
|
500 |
],
|
|
|
532 |
"iobj",
|
533 |
"mark",
|
534 |
"nmod",
|
|
|
|
|
535 |
"nmod:tmod",
|
536 |
"nsubj",
|
537 |
"nsubj:pass",
|
538 |
"nummod",
|
539 |
"obj",
|
540 |
"obl",
|
541 |
+
"obl:agent",
|
542 |
+
"obl:pmod",
|
543 |
"orphan",
|
544 |
"parataxis",
|
545 |
"punct",
|
|
|
597 |
],
|
598 |
"performance":{
|
599 |
"token_acc":0.9990029326,
|
600 |
+
"token_p":0.9967350492,
|
601 |
+
"token_r":0.9957244934,
|
602 |
+
"token_f":0.9959492157,
|
603 |
+
"tag_acc":0.9540563521,
|
604 |
+
"sents_p":0.9679572764,
|
605 |
+
"sents_r":0.9640957447,
|
606 |
+
"sents_f":0.9660226516,
|
607 |
+
"dep_uas":0.8691229961,
|
608 |
+
"dep_las":0.8114296468,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
609 |
"dep_las_per_type":{
|
610 |
"root":{
|
611 |
+
"p":0.8723021583,
|
612 |
+
"r":0.9133709981,
|
613 |
+
"f":0.8923643054
|
614 |
},
|
615 |
"mark":{
|
616 |
+
"p":0.9135338346,
|
617 |
+
"r":0.9169811321,
|
618 |
+
"f":0.9152542373
|
619 |
},
|
620 |
"case":{
|
621 |
+
"p":0.9534009771,
|
622 |
+
"r":0.951256093,
|
623 |
+
"f":0.9523273273
|
624 |
},
|
625 |
"nmod:tmod":{
|
626 |
+
"p":0.6888888889,
|
627 |
+
"r":0.2605042017,
|
628 |
+
"f":0.3780487805
|
629 |
},
|
630 |
"amod":{
|
631 |
+
"p":0.88,
|
632 |
+
"r":0.8807495741,
|
633 |
+
"f":0.8803746275
|
634 |
},
|
635 |
"nsubj":{
|
636 |
+
"p":0.835,
|
637 |
"r":0.7914691943,
|
638 |
+
"f":0.8126520681
|
639 |
},
|
640 |
"nmod":{
|
641 |
+
"p":0.7983435362,
|
642 |
+
"r":0.7915030346,
|
643 |
+
"f":0.7949085694
|
644 |
},
|
645 |
"aux":{
|
646 |
+
"p":0.9665427509,
|
647 |
+
"r":0.9506398537,
|
648 |
+
"f":0.9585253456
|
649 |
},
|
650 |
"advcl":{
|
651 |
+
"p":0.5862068966,
|
652 |
+
"r":0.6390977444,
|
653 |
+
"f":0.6115107914
|
654 |
},
|
655 |
"obj":{
|
656 |
+
"p":0.800443459,
|
657 |
+
"r":0.8337182448,
|
658 |
+
"f":0.8167420814
|
659 |
},
|
660 |
"det":{
|
661 |
+
"p":0.9417142857,
|
662 |
+
"r":0.933182333,
|
663 |
+
"f":0.9374288965
|
664 |
},
|
665 |
"cc":{
|
666 |
+
"p":0.9206680585,
|
667 |
+
"r":0.9206680585,
|
668 |
+
"f":0.9206680585
|
669 |
},
|
670 |
"conj":{
|
671 |
+
"p":0.5717948718,
|
672 |
+
"r":0.516801854,
|
673 |
+
"f":0.5429093122
|
674 |
},
|
675 |
"nummod":{
|
676 |
+
"p":0.8808777429,
|
677 |
+
"r":0.8726708075,
|
678 |
+
"f":0.8767550702
|
679 |
},
|
680 |
"acl":{
|
681 |
+
"p":0.7312138728,
|
682 |
+
"r":0.65374677,
|
683 |
+
"f":0.690313779
|
684 |
},
|
685 |
"advmod":{
|
686 |
+
"p":0.774535809,
|
687 |
+
"r":0.7704485488,
|
688 |
+
"f":0.7724867725
|
689 |
},
|
690 |
"obl":{
|
691 |
+
"p":0.6485042735,
|
692 |
+
"r":0.7703045685,
|
693 |
+
"f":0.7041763341
|
694 |
},
|
695 |
"expl:pass":{
|
696 |
+
"p":0.8367346939,
|
697 |
+
"r":0.7592592593,
|
698 |
+
"f":0.7961165049
|
699 |
},
|
700 |
"nsubj:pass":{
|
701 |
+
"p":0.8206896552,
|
702 |
+
"r":0.7256097561,
|
703 |
+
"f":0.7702265372
|
704 |
},
|
705 |
"fixed":{
|
706 |
+
"p":0.867803838,
|
707 |
+
"r":0.8604651163,
|
708 |
+
"f":0.864118896
|
709 |
},
|
710 |
"appos":{
|
711 |
+
"p":0.4136546185,
|
712 |
"r":0.393129771,
|
713 |
+
"f":0.4031311155
|
714 |
},
|
715 |
"parataxis":{
|
716 |
+
"p":0.2272727273,
|
717 |
+
"r":0.2857142857,
|
718 |
+
"f":0.253164557
|
719 |
},
|
720 |
"aux:pass":{
|
721 |
+
"p":0.9133333333,
|
722 |
"r":0.9133333333,
|
723 |
+
"f":0.9133333333
|
724 |
},
|
725 |
"nmod:agent":{
|
726 |
+
"p":0.0,
|
727 |
+
"r":0.0,
|
728 |
+
"f":0.0
|
729 |
},
|
730 |
"ccomp":{
|
731 |
+
"p":0.8548387097,
|
732 |
+
"r":0.8217054264,
|
733 |
+
"f":0.837944664
|
734 |
},
|
735 |
"nmod:pmod":{
|
736 |
+
"p":0.0,
|
737 |
+
"r":0.0,
|
738 |
+
"f":0.0
|
739 |
},
|
740 |
"iobj":{
|
741 |
+
"p":0.6842105263,
|
742 |
+
"r":0.6419753086,
|
743 |
+
"f":0.6624203822
|
744 |
},
|
745 |
"flat":{
|
746 |
+
"p":0.7493333333,
|
747 |
+
"r":0.7394736842,
|
748 |
+
"f":0.7443708609
|
749 |
},
|
750 |
"cop":{
|
751 |
+
"p":0.8034188034,
|
752 |
+
"r":0.7580645161,
|
753 |
+
"f":0.7800829876
|
754 |
},
|
755 |
"csubj":{
|
756 |
+
"p":1.0,
|
757 |
+
"r":0.8095238095,
|
758 |
+
"f":0.8947368421
|
759 |
},
|
760 |
"dep":{
|
761 |
"p":0.0,
|
762 |
"r":0.0,
|
763 |
"f":0.0
|
764 |
},
|
765 |
+
"obl:agent":{
|
766 |
+
"p":0.0,
|
767 |
+
"r":0.0,
|
768 |
+
"f":0.0
|
769 |
+
},
|
770 |
+
"xcomp":{
|
771 |
+
"p":0.3658536585,
|
772 |
+
"r":0.5555555556,
|
773 |
+
"f":0.4411764706
|
774 |
+
},
|
775 |
+
"obl:pmod":{
|
776 |
+
"p":0.0,
|
777 |
+
"r":0.0,
|
778 |
+
"f":0.0
|
779 |
+
},
|
780 |
"expl:pv":{
|
781 |
+
"p":0.7808219178,
|
782 |
+
"r":0.8260869565,
|
783 |
+
"f":0.8028169014
|
784 |
},
|
785 |
"expl":{
|
786 |
+
"p":0.696969697,
|
787 |
+
"r":0.8518518519,
|
788 |
+
"f":0.7666666667
|
789 |
},
|
790 |
"vocative":{
|
791 |
"p":0.0,
|
|
|
793 |
"f":0.0
|
794 |
},
|
795 |
"expl:poss":{
|
796 |
+
"p":1.0,
|
797 |
+
"r":0.9677419355,
|
798 |
+
"f":0.9836065574
|
799 |
},
|
800 |
"goeswith":{
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
801 |
"p":0.0,
|
802 |
"r":0.0,
|
803 |
"f":0.0
|
804 |
},
|
805 |
+
"ccomp:pmod":{
|
806 |
+
"p":0.0909090909,
|
807 |
+
"r":0.3333333333,
|
808 |
+
"f":0.1428571429
|
809 |
+
},
|
810 |
"orphan":{
|
811 |
"p":0.0,
|
812 |
"r":0.0,
|
813 |
"f":0.0
|
814 |
},
|
815 |
"expl:impers":{
|
816 |
+
"p":0.0,
|
817 |
+
"r":0.0,
|
818 |
+
"f":0.0
|
819 |
+
},
|
820 |
+
"compound":{
|
821 |
+
"p":0.75,
|
822 |
+
"r":0.4285714286,
|
823 |
+
"f":0.5454545455
|
824 |
},
|
825 |
"list":{
|
826 |
"p":0.0,
|
827 |
"r":0.0,
|
828 |
"f":0.0
|
829 |
},
|
830 |
+
"csubj:pass":{
|
831 |
+
"p":0.0,
|
832 |
+
"r":0.0,
|
833 |
+
"f":0.0
|
834 |
+
},
|
835 |
"cc:preconj":{
|
836 |
"p":0.0,
|
837 |
"r":0.0,
|
838 |
"f":0.0
|
839 |
+
}
|
840 |
+
},
|
841 |
+
"pos_acc":0.9297553275,
|
842 |
+
"morph_acc":0.9386929369,
|
843 |
+
"morph_micro_p":0.9832628288,
|
844 |
+
"morph_micro_r":0.9507350539,
|
845 |
+
"morph_micro_f":0.9636742939,
|
846 |
+
"morph_per_feat":{
|
847 |
+
"Case":{
|
848 |
+
"p":0.9877237852,
|
849 |
+
"r":0.9823222689,
|
850 |
+
"f":0.985015622
|
851 |
},
|
852 |
+
"Gender":{
|
853 |
+
"p":0.9819398838,
|
854 |
+
"r":0.9719964996,
|
855 |
+
"f":0.9769428912
|
856 |
+
},
|
857 |
+
"Number":{
|
858 |
+
"p":0.9836679143,
|
859 |
+
"r":0.9136205625,
|
860 |
+
"f":0.9473511742
|
861 |
+
},
|
862 |
+
"Person":{
|
863 |
+
"p":0.982373678,
|
864 |
+
"r":0.9852681202,
|
865 |
+
"f":0.9838187702
|
866 |
+
},
|
867 |
+
"PronType":{
|
868 |
+
"p":0.9937629938,
|
869 |
+
"r":0.991015895,
|
870 |
+
"f":0.9923875433
|
871 |
+
},
|
872 |
+
"Polarity":{
|
873 |
+
"p":0.9918433931,
|
874 |
+
"r":0.9967213115,
|
875 |
+
"f":0.9942763696
|
876 |
+
},
|
877 |
+
"AdpType":{
|
878 |
+
"p":0.999660787,
|
879 |
+
"r":0.9972927242,
|
880 |
+
"f":0.9984753515
|
881 |
+
},
|
882 |
+
"Definite":{
|
883 |
+
"p":0.9843724827,
|
884 |
+
"r":0.9698412698,
|
885 |
+
"f":0.9770528504
|
886 |
+
},
|
887 |
+
"Degree":{
|
888 |
+
"p":0.9400413508,
|
889 |
+
"r":0.916051041,
|
890 |
+
"f":0.9278911565
|
891 |
+
},
|
892 |
+
"VerbForm":{
|
893 |
+
"p":0.9644736842,
|
894 |
+
"r":0.9747340426,
|
895 |
+
"f":0.9695767196
|
896 |
+
},
|
897 |
+
"Abbr":{
|
898 |
+
"p":0.9754901961,
|
899 |
+
"r":0.8883928571,
|
900 |
+
"f":0.9299065421
|
901 |
+
},
|
902 |
+
"Poss":{
|
903 |
+
"p":1.0,
|
904 |
+
"r":0.9855421687,
|
905 |
+
"f":0.9927184466
|
906 |
+
},
|
907 |
+
"NumForm":{
|
908 |
+
"p":1.0,
|
909 |
+
"r":0.3223140496,
|
910 |
+
"f":0.4875
|
911 |
+
},
|
912 |
+
"NumType":{
|
913 |
+
"p":1.0,
|
914 |
+
"r":0.3241758242,
|
915 |
+
"f":0.489626556
|
916 |
+
},
|
917 |
+
"Reflex":{
|
918 |
+
"p":1.0,
|
919 |
+
"r":0.9935897436,
|
920 |
+
"f":0.9967845659
|
921 |
+
},
|
922 |
+
"Strength":{
|
923 |
+
"p":0.9919354839,
|
924 |
+
"r":0.9761904762,
|
925 |
+
"f":0.984
|
926 |
+
},
|
927 |
+
"Mood":{
|
928 |
+
"p":0.9584086799,
|
929 |
+
"r":0.9724770642,
|
930 |
+
"f":0.9653916211
|
931 |
+
},
|
932 |
+
"Tense":{
|
933 |
+
"p":0.9596541787,
|
934 |
+
"r":0.9680232558,
|
935 |
+
"f":0.9638205499
|
936 |
+
},
|
937 |
+
"Variant":{
|
938 |
+
"p":0.9865771812,
|
939 |
+
"r":0.9483870968,
|
940 |
+
"f":0.9671052632
|
941 |
+
},
|
942 |
+
"Position":{
|
943 |
+
"p":0.9910714286,
|
944 |
+
"r":0.9910714286,
|
945 |
+
"f":0.9910714286
|
946 |
+
},
|
947 |
+
"Number[psor]":{
|
948 |
+
"p":1.0,
|
949 |
+
"r":0.9,
|
950 |
+
"f":0.9473684211
|
951 |
+
},
|
952 |
+
"PartType":{
|
953 |
+
"p":0.9722222222,
|
954 |
+
"r":0.9459459459,
|
955 |
+
"f":0.9589041096
|
956 |
+
},
|
957 |
+
"Foreign":{
|
958 |
"p":0.0,
|
959 |
"r":0.0,
|
960 |
"f":0.0
|
961 |
}
|
962 |
},
|
963 |
+
"lemma_acc":0.8183070924,
|
964 |
+
"ents_p":0.7193937038,
|
965 |
+
"ents_r":0.711102574,
|
966 |
+
"ents_f":0.7152241113,
|
967 |
"ents_per_type":{
|
968 |
"DATETIME":{
|
969 |
+
"p":0.7883959044,
|
970 |
+
"r":0.8048780488,
|
971 |
+
"f":0.7965517241
|
972 |
},
|
973 |
"ORGANIZATION":{
|
974 |
+
"p":0.6325301205,
|
975 |
+
"r":0.6687898089,
|
976 |
+
"f":0.6501547988
|
977 |
},
|
978 |
"FACILITY":{
|
979 |
+
"p":0.4862385321,
|
980 |
"r":0.4045801527,
|
981 |
+
"f":0.4416666667
|
982 |
},
|
983 |
"NUMERIC_VALUE":{
|
984 |
+
"p":0.9227467811,
|
985 |
+
"r":0.9110169492,
|
986 |
+
"f":0.9168443497
|
987 |
},
|
988 |
"ORDINAL":{
|
989 |
+
"p":0.7857142857,
|
990 |
+
"r":0.8,
|
991 |
+
"f":0.7927927928
|
992 |
},
|
993 |
"EVENT":{
|
994 |
+
"p":0.5294117647,
|
995 |
+
"r":0.4864864865,
|
996 |
+
"f":0.5070422535
|
997 |
},
|
998 |
"GPE":{
|
999 |
+
"p":0.7953586498,
|
1000 |
+
"r":0.8666666667,
|
1001 |
+
"f":0.8294829483
|
1002 |
},
|
1003 |
"PERSON":{
|
1004 |
+
"p":0.6728187919,
|
1005 |
+
"r":0.6728187919,
|
1006 |
+
"f":0.6728187919
|
1007 |
},
|
1008 |
"NAT_REL_POL":{
|
1009 |
+
"p":0.8854961832,
|
1010 |
+
"r":0.7733333333,
|
1011 |
+
"f":0.8256227758
|
1012 |
},
|
1013 |
"MONEY":{
|
1014 |
+
"p":0.9230769231,
|
1015 |
+
"r":0.8275862069,
|
1016 |
+
"f":0.8727272727
|
1017 |
},
|
1018 |
"PRODUCT":{
|
1019 |
+
"p":0.427184466,
|
1020 |
+
"r":0.3211678832,
|
1021 |
+
"f":0.3666666667
|
1022 |
},
|
1023 |
"LOC":{
|
1024 |
+
"p":0.4782608696,
|
1025 |
+
"r":0.4342105263,
|
1026 |
+
"f":0.4551724138
|
1027 |
},
|
1028 |
"WORK_OF_ART":{
|
1029 |
+
"p":0.2222222222,
|
1030 |
+
"r":0.2105263158,
|
1031 |
+
"f":0.2162162162
|
1032 |
},
|
1033 |
"QUANTITY":{
|
1034 |
+
"p":0.7419354839,
|
1035 |
"r":0.8846153846,
|
1036 |
+
"f":0.8070175439
|
1037 |
},
|
1038 |
"LANGUAGE":{
|
1039 |
+
"p":0.6,
|
1040 |
+
"r":0.75,
|
1041 |
+
"f":0.6666666667
|
1042 |
},
|
1043 |
"PERIOD":{
|
1044 |
+
"p":0.8378378378,
|
1045 |
+
"r":0.7380952381,
|
1046 |
+
"f":0.7848101266
|
1047 |
}
|
1048 |
+
},
|
1049 |
+
"speed":8496.149813704
|
1050 |
},
|
1051 |
"sources":[
|
1052 |
{
|
|
|
1056 |
"author":"Michal M\u011bchura"
|
1057 |
},
|
1058 |
{
|
1059 |
+
"name":"UD Romanian RRT v2.8",
|
1060 |
"url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
|
1061 |
"license":"CC BY-SA 4.0",
|
1062 |
"author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
|
ner/model
CHANGED
Binary files a/ner/model and b/ner/model differ
|
|
parser/model
CHANGED
Binary files a/parser/model and b/parser/model differ
|
|
parser/moves
CHANGED
@@ -1 +1 @@
|
|
1 |
-
��moves
|
|
|
1 |
+
��moves�{"0":{"":86134},"1":{"":90421},"2":{"case":22293,"punct":9078,"det":9035,"nsubj":7080,"advmod":6417,"mark":5380,"cc":5367,"aux":4002,"obl":2028,"nummod":1887,"expl:pv":1796,"cop":1712,"aux:pass":1372,"amod":1370,"nsubj:pass":1013,"expl:pass":910,"parataxis":878,"obj":868,"advcl":713,"iobj":564,"expl:poss":469,"expl":393,"nmod":203,"nsubj||csubj":155,"nmod:tmod":153,"expl:impers":102,"xcomp":97,"advmod:tmod":84,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":45,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14500,"amod":9699,"obl":7775,"conj":7286,"fixed":5485,"obj":5462,"acl":4105,"advmod":2099,"advcl":2049,"ccomp":1932,"nummod":1667,"nsubj":1280,"obl:pmod":1208,"flat":1167,"det":1035,"appos":915,"xcomp":891,"iobj":803,"obl:agent":719,"csubj":632,"nsubj:pass":554,"parataxis":435,"case":434,"nmod:tmod":283,"ccomp:pmod":178,"cc":123,"cop":100,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
|
ro_core_news_sm-any-py3-none-any.whl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dddd9698256565d7ec89658ea742ff738136afa01a33b81d67b1f2df4519459e
|
3 |
+
size 16645399
|
senter/cfg
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
{
|
2 |
-
|
3 |
}
|
|
|
1 |
{
|
2 |
+
"overwrite":false
|
3 |
}
|
senter/model
CHANGED
Binary files a/senter/model and b/senter/model differ
|
|
tagger/cfg
CHANGED
@@ -10,6 +10,7 @@
|
|
10 |
"Afp",
|
11 |
"Afp-p-n",
|
12 |
"Afp-poy",
|
|
|
13 |
"Afpf--n",
|
14 |
"Afpfp-n",
|
15 |
"Afpfp-ny",
|
@@ -111,6 +112,7 @@
|
|
111 |
"Ds2ms-s",
|
112 |
"Ds3---p",
|
113 |
"Ds3---s",
|
|
|
114 |
"Ds3fp-s",
|
115 |
"Ds3fsos",
|
116 |
"Ds3fsrs",
|
@@ -139,18 +141,23 @@
|
|
139 |
"LSQR",
|
140 |
"LT",
|
141 |
"M",
|
142 |
-
"Mc",
|
143 |
"Mc-p-d",
|
144 |
"Mc-p-l",
|
|
|
|
|
|
|
145 |
"Mcfp-l",
|
146 |
"Mcfp-ln",
|
147 |
"Mcfprln",
|
148 |
"Mcfprly",
|
149 |
"Mcfsoln",
|
|
|
150 |
"Mcfsrln",
|
|
|
151 |
"Mcmp-l",
|
152 |
"Mcms-ln",
|
153 |
"Mcmsrl",
|
|
|
154 |
"Mcmsrly",
|
155 |
"Mffprln",
|
156 |
"Mffsrln",
|
@@ -223,7 +230,6 @@
|
|
223 |
"Pd3mpr--y",
|
224 |
"Pd3mso",
|
225 |
"Pd3msr",
|
226 |
-
"Pi3",
|
227 |
"Pi3--r",
|
228 |
"Pi3-po",
|
229 |
"Pi3-so",
|
@@ -269,6 +275,7 @@
|
|
269 |
"Pp3-po--------s",
|
270 |
"Pp3-sd--------w",
|
271 |
"Pp3-sd--y-----w",
|
|
|
272 |
"Pp3fpa--------w",
|
273 |
"Pp3fpa--y-----w",
|
274 |
"Pp3fpr--------s",
|
@@ -295,7 +302,6 @@
|
|
295 |
"Ps2fp-s",
|
296 |
"Ps2fsrp",
|
297 |
"Ps2fsrs",
|
298 |
-
"Ps2ms-s",
|
299 |
"Ps3---p",
|
300 |
"Ps3---s",
|
301 |
"Ps3fp-s",
|
@@ -328,7 +334,6 @@
|
|
328 |
"RPAR",
|
329 |
"RSQR",
|
330 |
"Rc",
|
331 |
-
"Rgc",
|
332 |
"Rgp",
|
333 |
"Rgpy",
|
334 |
"Rgs",
|
@@ -386,6 +391,7 @@
|
|
386 |
"Va--3s",
|
387 |
"Va--3s----y",
|
388 |
"Vag",
|
|
|
389 |
"Vaii1",
|
390 |
"Vaii2s",
|
391 |
"Vaii3p",
|
@@ -455,7 +461,7 @@
|
|
455 |
"Vmp--sm",
|
456 |
"Vmp--sm---y",
|
457 |
"Vmsp1p",
|
458 |
-
"
|
459 |
"Vmsp2s",
|
460 |
"Vmsp3",
|
461 |
"Vmsp3-----y",
|
@@ -468,7 +474,9 @@
|
|
468 |
"Ynmsoy",
|
469 |
"Ynmsry",
|
470 |
"Yp",
|
|
|
471 |
"Yp-sr",
|
472 |
"Yr"
|
473 |
-
]
|
|
|
474 |
}
|
|
|
10 |
"Afp",
|
11 |
"Afp-p-n",
|
12 |
"Afp-poy",
|
13 |
+
"Afp-srn",
|
14 |
"Afpf--n",
|
15 |
"Afpfp-n",
|
16 |
"Afpfp-ny",
|
|
|
112 |
"Ds2ms-s",
|
113 |
"Ds3---p",
|
114 |
"Ds3---s",
|
115 |
+
"Ds3---sy",
|
116 |
"Ds3fp-s",
|
117 |
"Ds3fsos",
|
118 |
"Ds3fsrs",
|
|
|
141 |
"LSQR",
|
142 |
"LT",
|
143 |
"M",
|
|
|
144 |
"Mc-p-d",
|
145 |
"Mc-p-l",
|
146 |
+
"Mc-s-b",
|
147 |
+
"Mc-s-d",
|
148 |
+
"Mc-s-l",
|
149 |
"Mcfp-l",
|
150 |
"Mcfp-ln",
|
151 |
"Mcfprln",
|
152 |
"Mcfprly",
|
153 |
"Mcfsoln",
|
154 |
+
"Mcfsrl",
|
155 |
"Mcfsrln",
|
156 |
+
"Mcfsrly",
|
157 |
"Mcmp-l",
|
158 |
"Mcms-ln",
|
159 |
"Mcmsrl",
|
160 |
+
"Mcmsrln",
|
161 |
"Mcmsrly",
|
162 |
"Mffprln",
|
163 |
"Mffsrln",
|
|
|
230 |
"Pd3mpr--y",
|
231 |
"Pd3mso",
|
232 |
"Pd3msr",
|
|
|
233 |
"Pi3--r",
|
234 |
"Pi3-po",
|
235 |
"Pi3-so",
|
|
|
275 |
"Pp3-po--------s",
|
276 |
"Pp3-sd--------w",
|
277 |
"Pp3-sd--y-----w",
|
278 |
+
"Pp3-so--------s",
|
279 |
"Pp3fpa--------w",
|
280 |
"Pp3fpa--y-----w",
|
281 |
"Pp3fpr--------s",
|
|
|
302 |
"Ps2fp-s",
|
303 |
"Ps2fsrp",
|
304 |
"Ps2fsrs",
|
|
|
305 |
"Ps3---p",
|
306 |
"Ps3---s",
|
307 |
"Ps3fp-s",
|
|
|
334 |
"RPAR",
|
335 |
"RSQR",
|
336 |
"Rc",
|
|
|
337 |
"Rgp",
|
338 |
"Rgpy",
|
339 |
"Rgs",
|
|
|
391 |
"Va--3s",
|
392 |
"Va--3s----y",
|
393 |
"Vag",
|
394 |
+
"Vag-------y",
|
395 |
"Vaii1",
|
396 |
"Vaii2s",
|
397 |
"Vaii3p",
|
|
|
461 |
"Vmp--sm",
|
462 |
"Vmp--sm---y",
|
463 |
"Vmsp1p",
|
464 |
+
"Vmsp2p",
|
465 |
"Vmsp2s",
|
466 |
"Vmsp3",
|
467 |
"Vmsp3-----y",
|
|
|
474 |
"Ynmsoy",
|
475 |
"Ynmsry",
|
476 |
"Yp",
|
477 |
+
"Yp,Yn",
|
478 |
"Yp-sr",
|
479 |
"Yr"
|
480 |
+
],
|
481 |
+
"overwrite":false
|
482 |
}
|
tagger/model
CHANGED
Binary files a/tagger/model and b/tagger/model differ
|
|
tok2vec/model
CHANGED
Binary files a/tok2vec/model and b/tok2vec/model differ
|
|
tokenizer
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
-
��prefix_search�
|
2 |
��A�
|
3 |
-
� ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)
|
|
|
1 |
+
��prefix_search�
|
2 |
��A�
|
3 |
+
� ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
|
vocab/strings.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eb4b6f8ed432c0240717aaf5b7b61d5e5825362699b3ae4a543fbbd012a9e0e8
|
3 |
+
size 1099352
|
vocab/vectors.cfg
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"mode":"default"
|
3 |
+
}
|