danielsteinigen
commited on
Commit
•
57dde9d
1
Parent(s):
328a68d
Update README.md
Browse files
README.md
CHANGED
@@ -35,19 +35,20 @@ It processes German tax laws as input and outputs the extracted key figures with
|
|
35 |
|
36 |
### Model Description
|
37 |
|
38 |
-
This is a fine-tuned token classification model, based on XLM-RoBERTa-Large, for the extraction of key figures and their logical connected properties from tax legal texts. The entity- and relation extraction tasks are trained in a combined model using initial trigger token to distinguish between the tasks. For relation extraction additional tokens are used to mark the extracted entities and predict the relations between them.
|
39 |
|
40 |
|
41 |
- **Model type:** fine-tuned token classification model, based on XLM-RoBERTa-Large
|
42 |
- **Language(s) (NLP):** German
|
43 |
|
44 |
-
### Model Sources
|
45 |
|
46 |
<!-- Provide the basic links for the model. -->
|
47 |
|
48 |
- **Repository:** https://github.com/danielsteinigen/nlp-legal-texts
|
49 |
- **Paper:** https://ceur-ws.org/Vol-3441/paper7.pdf
|
50 |
- **Demo:** https://huggingface.co/spaces/danielsteinigen/NLP-Legal-Texts
|
|
|
51 |
|
52 |
## Uses
|
53 |
```python
|
@@ -87,14 +88,14 @@ Training details can be found in our paper: [https://ceur-ws.org/Vol-3441/paper7
|
|
87 |
|
88 |
### Training Data
|
89 |
|
90 |
-
The model is trained on our dataset __*KeyFiTax*__, which is published here:[https://huggingface.co/datasets/danielsteinigen/KeyFiTax](https://huggingface.co/datasets/danielsteinigen/KeyFiTax)
|
91 |
|
92 |
## Evaluation
|
93 |
|
94 |
Evaluation details can be found in our paper: [https://ceur-ws.org/Vol-3441/paper7.pdf](https://ceur-ws.org/Vol-3441/paper7.pdf)
|
95 |
|
96 |
|
97 |
-
## Citation
|
98 |
|
99 |
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
100 |
|
@@ -115,7 +116,7 @@ Evaluation details can be found in our paper: [https://ceur-ws.org/Vol-3441/pape
|
|
115 |
|
116 |
**APA:**
|
117 |
|
118 |
-
Steinigen, D., Namysl, M., Hepperle, M., Krekeler, J., & Landgraf, S. (2023). Semantic Extraction of Key Figures and Their Properties From Tax Legal Texts Using Neural Models.
|
119 |
-
Proceedings of Sixth Workshop on Automated Semantic Analysis of Information in Legal Text, Braga, Portugal, June 23, 2023
|
120 |
CEUR-WS.org, online CEUR-WS.org/Vol-3441/paper7.pdf.
|
121 |
|
|
|
35 |
|
36 |
### Model Description
|
37 |
|
38 |
+
This is a fine-tuned token classification model, based on XLM-RoBERTa-Large, for the extraction of key figures and their logical connected properties from german tax legal texts. The entity- and relation extraction tasks are trained in a combined model using initial trigger token to distinguish between the tasks. For relation extraction additional tokens are used to mark the extracted entities and predict the relations between them.
|
39 |
|
40 |
|
41 |
- **Model type:** fine-tuned token classification model, based on XLM-RoBERTa-Large
|
42 |
- **Language(s) (NLP):** German
|
43 |
|
44 |
+
### Model Sources
|
45 |
|
46 |
<!-- Provide the basic links for the model. -->
|
47 |
|
48 |
- **Repository:** https://github.com/danielsteinigen/nlp-legal-texts
|
49 |
- **Paper:** https://ceur-ws.org/Vol-3441/paper7.pdf
|
50 |
- **Demo:** https://huggingface.co/spaces/danielsteinigen/NLP-Legal-Texts
|
51 |
+
- **Data:** https://huggingface.co/datasets/danielsteinigen/KeyFiTax
|
52 |
|
53 |
## Uses
|
54 |
```python
|
|
|
88 |
|
89 |
### Training Data
|
90 |
|
91 |
+
The model is trained on our dataset __*KeyFiTax*__, which is published here: [https://huggingface.co/datasets/danielsteinigen/KeyFiTax](https://huggingface.co/datasets/danielsteinigen/KeyFiTax)
|
92 |
|
93 |
## Evaluation
|
94 |
|
95 |
Evaluation details can be found in our paper: [https://ceur-ws.org/Vol-3441/paper7.pdf](https://ceur-ws.org/Vol-3441/paper7.pdf)
|
96 |
|
97 |
|
98 |
+
## Citation
|
99 |
|
100 |
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
101 |
|
|
|
116 |
|
117 |
**APA:**
|
118 |
|
119 |
+
Steinigen, D., Namysl, M., Hepperle, M., Krekeler, J., & Landgraf, S. (2023). Semantic Extraction of Key Figures and Their Properties From Tax Legal Texts Using Neural Models.
|
120 |
+
Proceedings of Sixth Workshop on Automated Semantic Analysis of Information in Legal Text, Braga, Portugal, June 23, 2023.
|
121 |
CEUR-WS.org, online CEUR-WS.org/Vol-3441/paper7.pdf.
|
122 |
|