Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# DATASET
|
2 |
MilliyetNER dataset was collected from the Turkish Milliyet newspaper articles between 1997-1998. This dataset is presented by [Tür et al. (2003)](https://www.cambridge.org/core/journals/natural-language-engineering/article/abs/statistical-information-extraction-system-for-turkish/7C288FAFC71D5F0763C1F8CE66464017). It was collected from news articles and manually annotated with three different entity types: Person, Location, Organization. The authors did not provide training/validation/test splits for this dataset. We provide the dataset splits used by [Yeniterzi et al. 2011](https://aclanthology.org/P11-3019).
|
3 |
For more information: [tdd.ai - MilliyetNER](https://data.tdd.ai/#/effafb5f-ebfc-4e5c-9a63-4f709ec1a135)
|
@@ -57,17 +70,4 @@ ORGANIZATION 0.95 0.92 0.94 863
|
|
57 |
micro avg 0.97 0.95 0.96 3233
|
58 |
macro avg 0.96 0.95 0.96 3233
|
59 |
weighted avg 0.97 0.95 0.96 3233
|
60 |
-
```
|
61 |
-
|
62 |
-
---
|
63 |
-
tags:
|
64 |
-
- ner
|
65 |
-
- token-classification
|
66 |
-
- berturk
|
67 |
-
- turkish
|
68 |
-
language: tr
|
69 |
-
datasets:
|
70 |
-
- MilliyetNER
|
71 |
-
widget:
|
72 |
-
- text: "Türkiye'nin başkenti Ankara'dır ve ilk cumhurbaşkanı Mustafa Kemal Atatürk'tür."
|
73 |
-
---
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- ner
|
4 |
+
- token-classification
|
5 |
+
- berturk
|
6 |
+
- turkish
|
7 |
+
language: tr
|
8 |
+
datasets:
|
9 |
+
- MilliyetNER
|
10 |
+
widget:
|
11 |
+
- text: "Türkiye'nin başkenti Ankara'dır ve ilk cumhurbaşkanı Mustafa Kemal Atatürk'tür."
|
12 |
+
---
|
13 |
+
|
14 |
# DATASET
|
15 |
MilliyetNER dataset was collected from the Turkish Milliyet newspaper articles between 1997-1998. This dataset is presented by [Tür et al. (2003)](https://www.cambridge.org/core/journals/natural-language-engineering/article/abs/statistical-information-extraction-system-for-turkish/7C288FAFC71D5F0763C1F8CE66464017). It was collected from news articles and manually annotated with three different entity types: Person, Location, Organization. The authors did not provide training/validation/test splits for this dataset. We provide the dataset splits used by [Yeniterzi et al. 2011](https://aclanthology.org/P11-3019).
|
16 |
For more information: [tdd.ai - MilliyetNER](https://data.tdd.ai/#/effafb5f-ebfc-4e5c-9a63-4f709ec1a135)
|
|
|
70 |
micro avg 0.97 0.95 0.96 3233
|
71 |
macro avg 0.96 0.95 0.96 3233
|
72 |
weighted avg 0.97 0.95 0.96 3233
|
73 |
+
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|