LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Paper • 2412.01292 • Published Dec 2, 2024 • 12
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding Paper • 2311.03354 • Published Nov 6, 2023 • 4
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference Paper • 2306.14393 • Published Jun 26, 2023