sfrenda commited on
Commit
a2ce5dc
1 Parent(s): e0d8377

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -11
README.md CHANGED
@@ -9,32 +9,59 @@ tags:
9
  language:
10
  - it
11
  ---
 
12
 
13
- This model is a fine-tuned version of [AlBERTo](https://huggingface.co/m-polignano-uniba/bert_uncased_L-12_H-768_A-12_italian_alberto) Italian model on the following datasets annotated with the presence of irony:
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
  - [IronITA 2018](https://live.european-language-grid.eu/catalogue/corpus/7372)
16
  - [Sarcastic Hate Speech dataset](https://github.com/simonasnow/Sarcastic-Hate-Speech)
17
  - SENTIPOLC [2014](https://live.european-language-grid.eu/catalogue/corpus/7480)/[2016](https://live.european-language-grid.eu/catalogue/corpus/7479)
18
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  It was tested on IronITA test set obtaining the following results:
20
 
 
 
21
  - macro F1: 0.79
22
  - accuracy: 0.79
23
  - precision of positive class: 0.77
24
  - recall of positive class: 0.84
25
  - F1 of positive class: 0.80
26
 
27
- Training hyperparameters:
28
-
29
- - learning_rate: 2e-5
30
- - train_batch_size: 16
31
- - eval_batch_size: 16
32
- - seed: 42
33
- - optimizer: Adam
34
-
35
- Framework versions:
36
 
37
  - Transformers 4.30.2
38
  - Pytorch 2.1.2
39
  - Datasets 2.19.0
40
- - Accelerate=0.30.0-
 
 
 
 
 
 
 
 
 
9
  language:
10
  - it
11
  ---
12
+ # Model Details
13
 
14
+ ## Model Description
15
+
16
+ - **Developed by:** [aequa-tech](https://aequa-tech.com/)
17
+ - **Funded by:** [NGI-Search](https://www.ngi.eu/ngi-projects/ngi-search/)
18
+ - **Language(s) (NLP):** Italian
19
+ - **License:** apache-2.0
20
+ - **Finetuned from model:** [AlBERTo](https://huggingface.co/m-polignano-uniba/bert_uncased_L-12_H-768_A-12_italian_alberto)
21
+
22
+ This model is a fine-tuned version of [AlBERTo](https://huggingface.co/m-polignano-uniba/bert_uncased_L-12_H-768_A-12_italian_alberto) Italian model on **irony detection**:
23
+
24
+ # Training Details
25
+
26
+ ## Training Data
27
 
28
  - [IronITA 2018](https://live.european-language-grid.eu/catalogue/corpus/7372)
29
  - [Sarcastic Hate Speech dataset](https://github.com/simonasnow/Sarcastic-Hate-Speech)
30
  - SENTIPOLC [2014](https://live.european-language-grid.eu/catalogue/corpus/7480)/[2016](https://live.european-language-grid.eu/catalogue/corpus/7479)
31
 
32
+ ## Training Hyperparameters
33
+
34
+ - learning_rate: 2e-5
35
+ - train_batch_size: 16
36
+ - eval_batch_size: 16
37
+ - seed: 42
38
+ - optimizer: Adam
39
+
40
+
41
+ # Evaluation
42
+
43
+ ## Testing Data
44
  It was tested on IronITA test set obtaining the following results:
45
 
46
+ ## Metrics and Results
47
+
48
  - macro F1: 0.79
49
  - accuracy: 0.79
50
  - precision of positive class: 0.77
51
  - recall of positive class: 0.84
52
  - F1 of positive class: 0.80
53
 
54
+ # Framework versions
 
 
 
 
 
 
 
 
55
 
56
  - Transformers 4.30.2
57
  - Pytorch 2.1.2
58
  - Datasets 2.19.0
59
+ - Accelerate=0.30.0
60
+
61
+ # How to use this model:
62
+ ```
63
+ model = AutoModelForSequenceClassification.from_pretrained('aequa-tech/irony',num_labels=2)
64
+ tokenizer = AutoTokenizer.from_pretrained("m-polignano-uniba/bert_uncased_L-12_H-768_A-12_italian_alb3rt0")
65
+ classifier = pipeline("text-classification", model=model, tokenizer=tokenizer)
66
+ classifier("")
67
+ ```