Organization Card

stacked summaries

This organization exists to test and evaluate the (potential) benefits of "task-oriented pretraining" as popularized by the FLAN-t5 series of models within the summarization NLP task.

mission statement

The idea is to apply a similar concept but adjusted to be more specific w.r.t. the summarization task. Hopefully, this will train models that actually "know" how to condense and distill meaningful information from text rather than learning some naive style transfer of "this is what the dataset summaries sound like so I will do that with essential words."

The most apparent augmentation/task is "stacking" summaries that are shorter than MAX_LENGTH_TOKENS when combined, so the model has to learn to separate and group summaries for these independent concepts.

work statement

unless otherwise noted, work here is completed by "night shift" NLP enthusiasts/researchers in their free time.

models 4

datasets 4

stacked-summaries/onlystacked-xsum-1024

Viewer • Updated Oct 16, 2023 • 222k • 214

stacked-summaries/stacked-xsum-1024

Viewer • Updated Oct 8, 2023 • 357k • 198 • 1

stacked-summaries/stacked-samsum-1024

Viewer • Updated May 29, 2023 • 32.7k • 145 • 5

stacked-summaries/stacked-xsum

Viewer • Updated Jan 12, 2023 • 185k • 169 • 2

Stacked Summaries

AI & ML interests

stacked summaries

mission statement

work statement

models 4

stacked-summaries/flan-t5-large-stacked-xsum-1024

stacked-summaries/flan-t5-large-samsum

stacked-summaries/flan-t5-large-stacked-samsum-1024

stacked-summaries/flan-t5-small-stacked-samsum-1024

datasets 4

stacked-summaries/onlystacked-xsum-1024

stacked-summaries/stacked-xsum-1024

stacked-summaries/stacked-samsum-1024

stacked-summaries/stacked-xsum

AI & ML interests

Team members 3

stacked summaries

mission statement

work statement

models 4 Sort: Recently updated

datasets 4 Sort: Recently updated

models 4

datasets 4