Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task Paper • 1809.08887 • Published Sep 24, 2018 • 2
ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks Paper • 1909.01716 • Published Sep 4, 2019
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models Paper • 2305.17311 • Published May 27, 2023 • 1
WILDS: A Benchmark of in-the-Wild Distribution Shifts Paper • 2012.07421 • Published Dec 14, 2020 • 1
LM-Critic: Language Models for Unsupervised Grammatical Error Correction Paper • 2109.06822 • Published Sep 14, 2021
Extending the WILDS Benchmark for Unsupervised Adaptation Paper • 2112.05090 • Published Dec 9, 2021 • 1
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models Paper • 2201.05966 • Published Jan 16, 2022 • 1
GreaseLM: Graph REASoning Enhanced Language Models for Question Answering Paper • 2201.08860 • Published Jan 21, 2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text Paper • 2403.18421 • Published Mar 27, 2024 • 23
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases Paper • 2404.13207 • Published Apr 19, 2024
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models Paper • 2405.14831 • Published May 23, 2024 • 3
Is ChatGPT a General-Purpose Natural Language Processing Task Solver? Paper • 2302.06476 • Published Feb 8, 2023
Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts Paper • 2302.09155 • Published Feb 17, 2023