PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving
Abstract
Recent agent frameworks and inference-time algorithms often struggle with complex planning problems due to limitations in verifying generated plans or reasoning and varying complexity of instances within a single task. Many existing methods for these tasks either perform task-level verification without considering constraints or apply inference-time algorithms without adapting to instance-level complexity. To address these limitations, we propose PlanGEN, a model-agnostic and easily scalable agent framework with three key components: constraint, verification, and selection agents. Specifically, our approach proposes constraint-guided iterative verification to enhance performance of inference-time algorithms--Best of N, Tree-of-Thought, and REBASE. In PlanGEN framework, the selection agent optimizes algorithm choice based on instance complexity, ensuring better adaptability to complex planning problems. Experimental results demonstrate significant improvements over the strongest baseline across multiple benchmarks, achieving state-of-the-art results on NATURAL PLAN (sim8%uparrow), OlympiadBench (sim4%uparrow), DocFinQA (sim7%uparrow), and GPQA (sim1%uparrow). Our key finding highlights that constraint-guided iterative verification improves inference-time algorithms, and adaptive selection further boosts performance on complex planning and reasoning problems.
Community
🚀 Excited to share that our new paper, "PlanGEN", is now out!
We introduce a novel, model-agnostic, and easily scalable/adaptable multi-agent framework utilizing inference-time algorithms designed to generate natural planning and reasoning trajectories to solve complex tasks. By integrating three specialized agents—Constraint, Verification, and Selection—PlanGEN dynamically adapts various inference-time algorithms to effectively solve given problem instances. This means we can:
- Extract instance-specific constraints tailored to the planning and reasoning problem,
- Evaluate candidate plans with detailed, constraint-driven verification, and
- Select the optimal inference-time algorithm based on the task's complexity.
Our experiments show significant performance gains across multiple benchmarks, including NATURAL PLAN, OlympiadBench, DocFinQA, and GPQA.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning (2025)
- AgentBreeder: Mitigating the AI Safety Impact of Multi-Agent Scaffolds (2025)
- SRMT: Shared Memory for Multi-agent Lifelong Pathfinding (2025)
- MACI: Multi-Agent Collaborative Intelligence for Adaptive Reasoning and Temporal Planning (2025)
- Towards Principled Multi-Agent Task Agnostic Exploration (2025)
- LLM-Powered Decentralized Generative Agents with Adaptive Hierarchical Knowledge Graph for Cooperative Planning (2025)
- Cooperative Multi-Agent Planning with Adaptive Skill Synthesis (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper