PlotEdit: Natural Language-Driven Accessible Chart Editing in PDFs via Multimodal LLM Agents Paper • 2501.11233 • Published 15 days ago
Personalized Graph-Based Retrieval for Large Language Models Paper • 2501.02157 • Published about 1 month ago • 28
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models Paper • 2501.00874 • Published Jan 1 • 13
VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation Paper • 2412.10704 • Published Dec 14, 2024 • 15
Personalized Multimodal Large Language Models: A Survey Paper • 2412.02142 • Published Dec 3, 2024 • 13
LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding Paper • 2411.01106 • Published Nov 2, 2024 • 4
DynaSaur: Large Language Agents Beyond Predefined Actions Paper • 2411.01747 • Published Nov 4, 2024 • 26
Taipan: Efficient and Expressive State Space Language Models with Selective Attention Paper • 2410.18572 • Published Oct 24, 2024 • 17
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding Paper • 2410.16472 • Published Oct 21, 2024
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use Paper • 2410.16400 • Published Oct 21, 2024
Taipan: Efficient and Expressive State Space Language Models with Selective Attention Paper • 2410.18572 • Published Oct 24, 2024 • 17
ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning Paper • 2408.03402 • Published Aug 6, 2024 • 2
KaPQA: Knowledge-Augmented Product Question-Answering Paper • 2407.16073 • Published Jul 22, 2024 • 1