Spaces:
Runtime error
Runtime error
Add description
Browse files
app.py
CHANGED
@@ -83,10 +83,11 @@ st.title("RAFT Leaderboard")
|
|
83 |
st.markdown("""
|
84 |
RAFT: Real-world Annotated Few-shot Tasks
|
85 |
|
86 |
-
Large pre-trained language models have shown promise for few-shot learning, completing text-based tasks given only a few task-specific examples. Will models soon solve classification tasks that have so far been reserved for human research assistants? RAFT is a few-shot classification benchmark that tests language models
|
87 |
-
|
88 |
-
|
89 |
-
|
|
|
90 |
|
91 |
To submit to RAFT, follow the instruction posted on [this page](https://github.com/oughtinc/raft_submission).
|
92 |
""")
|
|
|
83 |
st.markdown("""
|
84 |
RAFT: Real-world Annotated Few-shot Tasks
|
85 |
|
86 |
+
Large pre-trained language models have shown promise for few-shot learning, completing text-based tasks given only a few task-specific examples. Will models soon solve classification tasks that have so far been reserved for human research assistants? RAFT is a few-shot classification benchmark that tests language models:
|
87 |
+
|
88 |
+
- across multiple domains (lit review, tweets, customer interaction, etc.)
|
89 |
+
- on economically valuable classification tasks (someone inherently cares about the task)
|
90 |
+
- in a setting that mirrors deployment (50 examples per task, info retrieval allowed, hidden test set)
|
91 |
|
92 |
To submit to RAFT, follow the instruction posted on [this page](https://github.com/oughtinc/raft_submission).
|
93 |
""")
|