LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention Paper • 2303.16199 • Published Mar 28, 2023 • 4
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Paper • 2304.09842 • Published Apr 19, 2023 • 1
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Paper • 2310.02255 • Published Oct 3, 2023 • 2
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model Paper • 2304.15010 • Published Apr 28, 2023 • 4
Multimodal Procedural Planning via Dual Text-Image Prompting Paper • 2305.01795 • Published May 2, 2023 • 1
TheoremQA: A Theorem-driven Question Answering dataset Paper • 2305.12524 • Published May 21, 2023 • 1
Model Editing Can Hurt General Abilities of Large Language Models Paper • 2401.04700 • Published Jan 9, 2024 • 3
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models Paper • 2307.10635 • Published Jul 20, 2023 • 9
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering Paper • 2209.09513 • Published Sep 20, 2022 • 1
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning Paper • 2209.14610 • Published Sep 29, 2022
Enhancing Large Vision Language Models with Self-Training on Image Comprehension Paper • 2405.19716 • Published May 30, 2024
UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression Paper • 2212.02746 • Published Dec 6, 2022
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning Paper • 2105.04165 • Published May 10, 2021
VDebugger: Harnessing Execution Feedback for Debugging Visual Programs Paper • 2406.13444 • Published Jun 19, 2024
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models Paper • 2410.08182 • Published Oct 10, 2024
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning Paper • 2412.02172 • Published Dec 3, 2024
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Paper • 2502.11271 • Published 16 days ago • 16
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Paper • 2502.05163 • Published 25 days ago • 22