Datasets and models for EMNLP paper "Scalable Data Ablation Approximations for Language Models through Modular Training and Merging"
Clara Na
claran
AI & ML interests
None yet
Recent Activity
updated
a dataset
4 days ago
claran/wikitext-2-noheader-sample
published
a dataset
4 days ago
claran/wikitext-2-noheader-sample
updated
a dataset
4 days ago
claran/wikitext-2-nonulls-sample
Organizations
Collections
1
Papers
1
models
30
claran/s2orc-biology1994-1999-ind-130m
Updated
•
3
claran/s2orc-biology2007-2008-ind-130m
Updated
•
3
claran/s2orc-biology2013-2013-ind-130m
Updated
•
2
claran/s2orc-biology2021-2021-ind-130m
Updated
•
2
claran/s2orc-biology2019-2019-ind-130m
Updated
•
7
claran/s2orc-biology2000-2003-ind-130m
Updated
•
1
claran/s2orc-biology2015-2015-ind-130m
Updated
•
5
claran/s2orc-biology2014-2014-ind-130m
Updated
•
14
claran/s2orc-biology2004-2006-ind-130m
Updated
•
5
claran/s2orc-biology2016-2016-ind-130m
Updated
•
3
datasets
13
claran/wikitext-2-noheader-sample
Viewer
•
Updated
•
10k
•
9
claran/wikitext-2-nonulls-sample
Viewer
•
Updated
•
10k
•
100
claran/samsum_sample
Viewer
•
Updated
•
1k
•
67
claran/xsum_sample
Viewer
•
Updated
•
10k
•
33
claran/cnn_dailymail_sample
Viewer
•
Updated
•
10k
•
75
claran/wikitext-2-sample
Viewer
•
Updated
•
10k
•
139
claran/bookcorpus_sample
Viewer
•
Updated
•
10k
•
166
claran/modular-s2orc
Viewer
•
Updated
•
7.47M
•
613
•
3
claran/seed-pretrain-decon
Viewer
•
Updated
•
3.45M
•
96
claran/m2d2-wiki-decon
Viewer
•
Updated
•
5.3M
•
89