lrakotoson
commited on
Commit
•
44d5302
1
Parent(s):
a28fe65
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,15 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# AI2 SciTLDR
|
2 |
Fairseq checkpoints from CATTS XSUM to Transformers BART (Abtract Only)
|
3 |
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
datasets:
|
5 |
+
- xsum
|
6 |
+
- scitldr
|
7 |
+
widget:
|
8 |
+
- text: "We introduce TLDR generation, a new form of extreme summarization, for scientific papers. TLDR generation involves high source compression and requires expert background knowledge and understanding of complex domain-specific language. To facilitate study on this task, we introduce SciTLDR, a new multi-target dataset of 5.4K TLDRs over 3.2K papers. SciTLDR contains both author-written and expert-derived TLDRs, where the latter are collected using a novel annotation protocol that produces high-quality summaries while minimizing annotation burden. We propose CATTS, a simple yet effective learning strategy for generating TLDRs that exploits titles as an auxiliary training signal. CATTS improves upon strong baselines under both automated metrics and human evaluations."
|
9 |
+
license: "apache-2.0"
|
10 |
+
|
11 |
+
---
|
12 |
+
|
13 |
# AI2 SciTLDR
|
14 |
Fairseq checkpoints from CATTS XSUM to Transformers BART (Abtract Only)
|
15 |
|