Spaces:
Runtime error
Runtime error
Commit
·
6cd3eaa
1
Parent(s):
e49f3de
Update app.py
Browse files
app.py
CHANGED
@@ -221,11 +221,14 @@ with demo:
|
|
221 |
# gr.Markdown("If a model can reliably report the underspecification of an inference-time task, an AI systems can replace only those task predictions with\
|
222 |
# an appropriate heuristic or information retrieval process.")
|
223 |
gr.Markdown("*Follow the numbered steps below to test one of the pre-loaded options.* Once you get the hang of it, you can load a new model and/or provide your own input texts.")
|
224 |
-
|
225 |
-
|
226 |
-
|
227 |
-
|
228 |
-
|
|
|
|
|
|
|
229 |
|
230 |
|
231 |
|
|
|
221 |
# gr.Markdown("If a model can reliably report the underspecification of an inference-time task, an AI systems can replace only those task predictions with\
|
222 |
# an appropriate heuristic or information retrieval process.")
|
223 |
gr.Markdown("*Follow the numbered steps below to test one of the pre-loaded options.* Once you get the hang of it, you can load a new model and/or provide your own input texts.")
|
224 |
+
gr.Markdown(f"""1) Pick a preloaded BERT-like model.
|
225 |
+
Note: RoBERTa-large performance is best.
|
226 |
+
2) Pick an Occupation type from the Winogender Schemas evaluation set.
|
227 |
+
Or select '{PICK_YOUR_OWN_LABEL}' (it need not be about an occupation).
|
228 |
+
3) Click button to load input texts.
|
229 |
+
Read the sentences to determine which two are well-specified for gendered pronoun coreference resolution. The rest are gender-unspecified.
|
230 |
+
4) Click button to get Task Specification Metric results!
|
231 |
+
"""
|
232 |
|
233 |
|
234 |
|