denizspynk commited on
Commit
3311278
·
1 Parent(s): fa09cb9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -28,26 +28,26 @@ should probably proofread and complete it, then remove this comment. -->
28
 
29
  # requirements_ambiguity_v2
30
 
31
- This model is a fine-tuned version of [GroNLP/bert-base-dutch-cased](https://huggingface.co/GroNLP/bert-base-dutch-cased) on an unknown dataset.
 
 
 
 
 
32
  It achieves the following results on the evaluation set:
33
  - Loss: 0.7485
34
  - Accuracy: 0.8458
35
  - F1: 0.8442
36
  - Recall: 0.7474
37
 
38
- ## Model description
39
-
40
- More information needed
41
-
42
  ## Intended uses & limitations
43
 
44
- More information needed
45
 
46
  ## Training and evaluation data
47
 
48
- More information needed
49
-
50
- ## Training procedure
51
 
52
  ### Training hyperparameters
53
 
 
28
 
29
  # requirements_ambiguity_v2
30
 
31
+ This model is a fine-tuned version of [GroNLP/bert-base-dutch-cased](https://huggingface.co/GroNLP/bert-base-dutch-cased) on a private dataset with 2,523 labeled software requirements for ambiguity detection.
32
+
33
+ Please contact me via [LinkedIn](https://www.linkedin.com/in/denizayhan/) if you have any questions about this model or the dataset used.
34
+
35
+ The dataset and this model were created as part of the final project assignment of the Natural Language Understanding course (XCS224U) from the Professional AI Program of the Stanford School of Engineering.
36
+
37
  It achieves the following results on the evaluation set:
38
  - Loss: 0.7485
39
  - Accuracy: 0.8458
40
  - F1: 0.8442
41
  - Recall: 0.7474
42
 
 
 
 
 
43
  ## Intended uses & limitations
44
 
45
+ The model performs automated ambiguity detection through binary text classification. Its intended use is as a tool voor requirements engineers to detect spurious and ambiguous formulations.
46
 
47
  ## Training and evaluation data
48
 
49
+ The model was trained on ReqAmbi dataset. This dataset is private and contains 2,523 requirement formulations. Each requirement is manually
50
+ labeled 0 (unambiguous) or 1 (ambiguous). The dataset is split 2,019/253/253 into train, validation and test. The reported metrics are from the evaluation on the test set. The validation set was used for cross-validation during training.
 
51
 
52
  ### Training hyperparameters
53