Datasets and models for EMNLP paper "Scalable Data Ablation Approximations for Language Models through Modular Training and Merging"
Clara Na
claran
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 4 hours ago
claran/wmt14-fr-en-sample
published
a dataset
about 4 hours ago
claran/wmt14-fr-en-sample
updated
a dataset
4 days ago
claran/imdb_sample
Organizations
Collections
1
Papers
1
models
30
claran/s2orc-biology1994-1999-ind-130m
Updated
•
3
claran/s2orc-biology2007-2008-ind-130m
Updated
•
3
claran/s2orc-biology2013-2013-ind-130m
Updated
•
3
claran/s2orc-biology2021-2021-ind-130m
Updated
•
2
claran/s2orc-biology2019-2019-ind-130m
Updated
•
8
claran/s2orc-biology2000-2003-ind-130m
Updated
•
1
claran/s2orc-biology2015-2015-ind-130m
Updated
•
3
claran/s2orc-biology2014-2014-ind-130m
Updated
•
14
claran/s2orc-biology2004-2006-ind-130m
Updated
•
4
claran/s2orc-biology2016-2016-ind-130m
Updated
•
4
datasets
15
claran/wmt14-fr-en-sample
Viewer
•
Updated
•
1.02k
claran/imdb_sample
Viewer
•
Updated
•
1.02k
•
15
claran/wikitext-2-noheader-sample
Viewer
•
Updated
•
10k
•
38
claran/wikitext-2-nonulls-sample
Viewer
•
Updated
•
10k
•
291
claran/samsum_sample
Viewer
•
Updated
•
1k
•
77
claran/xsum_sample
Viewer
•
Updated
•
10k
•
46
claran/cnn_dailymail_sample
Viewer
•
Updated
•
10k
•
82
claran/wikitext-2-sample
Viewer
•
Updated
•
10k
•
145
claran/bookcorpus_sample
Viewer
•
Updated
•
10k
•
174
claran/modular-s2orc
Viewer
•
Updated
•
7.47M
•
492
•
3