Spaces:
Sleeping
Sleeping
Oskar van der Wal
commited on
Update notice.md
Browse files
notice.md
CHANGED
@@ -5,4 +5,8 @@ What is problematic in the USA, may not be relevant in the Netherlands---each cu
|
|
5 |
Furthermore, defining good ways to measure it is also difficult.
|
6 |
For example, [Blodgett et al. (2021)](https://aclanthology.org/2021.acl-long.81/) find that typos, nonsensical examples, and other mistakes threaten the validity of CrowS-Pairs, the dataset we show above.
|
7 |
|
8 |
-
|
|
|
|
|
|
|
|
|
|
5 |
Furthermore, defining good ways to measure it is also difficult.
|
6 |
For example, [Blodgett et al. (2021)](https://aclanthology.org/2021.acl-long.81/) find that typos, nonsensical examples, and other mistakes threaten the validity of CrowS-Pairs, the dataset we show above.
|
7 |
|
8 |
+
# Results for French and English language models
|
9 |
+
[From the paper proposing this version of CrowS-Pairs](https://aclanthology.org/2022.acl-long.583.pdf):
|
10 |
+
"Bias evaluation on the enriched CrowS-pairs corpus, after collection of new sentences in French, translation to create a bilingual corpus, revision and filtering. A score of 50 indicates an absence of bias. Higher scores indicate stronger preference for biased sentences. In header, "BT" used for "BERT" due to space constraints."
|
11 |
+
|
12 |
+
![](aggregated_results_crows-pairs.PNG)
|