From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Paper • 2502.14802 • Published 6 days ago • 10
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents Paper • 2502.11357 • Published 9 days ago • 9
Diversifying Joint Vision-Language Tokenization Learning Paper • 2306.03421 • Published Jun 6, 2023 • 1
A Systematic Investigation of KB-Text Embedding Alignment at Scale Paper • 2106.01586 • Published Jun 3, 2021
Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs Paper • 2401.00608 • Published Dec 31, 2023 • 1
A Retrieve-and-Read Framework for Knowledge Graph Link Prediction Paper • 2212.09724 • Published Dec 19, 2022 • 1
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents Paper • 2502.11357 • Published 9 days ago • 9
A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis Paper • 2311.04157 • Published Nov 7, 2023
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Paper • 2311.16502 • Published Nov 27, 2023 • 35
BIOCLIP: A Vision Foundation Model for the Tree of Life Paper • 2311.18803 • Published Nov 30, 2023 • 1
Bootstrapping a User-Centered Task-Oriented Dialogue System Paper • 2207.05223 • Published Jul 11, 2022
arXivEdits: Understanding the Human Revision Process in Scientific Writing Paper • 2210.15067 • Published Oct 26, 2022
Sparse Autoencoders for Scientifically Rigorous Interpretation of Vision Models Paper • 2502.06755 • Published 16 days ago • 7