BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
The BrowserGym Ecosystem for Web Agent Research Paper • 2412.05467 • Published Dec 6, 2024 • 19
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 58
The Impact of Positional Encoding on Length Generalization in Transformers Paper • 2305.19466 • Published May 31, 2023 • 2
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Paper • 2410.01679 • Published Oct 2, 2024 • 24
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models Paper • 2305.14775 • Published May 24, 2023
BM25S: Orders of magnitude faster lexical search via eager sparse scoring Paper • 2407.03618 • Published Jul 4, 2024 • 11
Improving Automatic VQA Evaluation Using Large Language Models Paper • 2310.02567 • Published Oct 4, 2023 • 3
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations Paper • 2407.03471 • Published Jul 3, 2024 • 29
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 51
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild Paper • 2406.04770 • Published Jun 7, 2024 • 28
Are NLP Models really able to Solve Simple Math Word Problems? Paper • 2103.07191 • Published Mar 12, 2021 • 1
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions Paper • 2310.03016 • Published Oct 4, 2023 • 2