PeppoCola commited on
Commit
b12a845
·
1 Parent(s): 1a17051

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -32,6 +32,12 @@ The model is trained on a dataset of labeled issue reports and is designed to pr
32
  | enhancement | 299,287 (41.4%) | 33,290 (41.3%) |
33
  | question | 62,373 (8.6%) | 7,076 (8.8%) |
34
 
 
 
 
 
 
 
35
  ## Metrics
36
 
37
  The model is evaluated using the following metrics:
 
32
  | enhancement | 299,287 (41.4%) | 33,290 (41.3%) |
33
  | question | 62,373 (8.6%) | 7,076 (8.8%) |
34
 
35
+
36
+ ## Data preprocessing
37
+ The data used for training was preprocessed with [ekphrasis](https://github.com/cbaziotis/ekphrasis), adding some regular expressions to remove code, images and URLs.
38
+ Check out our [GitHub](https://github.com/collab-uniba/Issue-Report-Classification-Using-RoBERTa) code for more information about this.
39
+
40
+
41
  ## Metrics
42
 
43
  The model is evaluated using the following metrics: