arxiv:2502.16111

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

Published on Feb 22

· Submitted by

Authors:

Abstract

Recent agent frameworks and inference-time algorithms often struggle with complex planning problems due to limitations in verifying generated plans or reasoning and varying complexity of instances within a single task. Many existing methods for these tasks either perform task-level verification without considering constraints or apply inference-time algorithms without adapting to instance-level complexity. To address these limitations, we propose PlanGEN, a model-agnostic and easily scalable agent framework with three key components: constraint, verification, and selection agents. Specifically, our approach proposes constraint-guided iterative verification to enhance performance of inference-time algorithms--Best of N, Tree-of-Thought, and REBASE. In PlanGEN framework, the selection agent optimizes algorithm choice based on instance complexity, ensuring better adaptability to complex planning problems. Experimental results demonstrate significant improvements over the strongest baseline across multiple benchmarks, achieving state-of-the-art results on NATURAL PLAN (sim8%uparrow), OlympiadBench (sim4%uparrow), DocFinQA (sim7%uparrow), and GPQA (sim1%uparrow). Our key finding highlights that constraint-guided iterative verification improves inference-time algorithms, and adaptive selection further boosts performance on complex planning and reasoning problems.

View arXiv page View PDF Add to collection

Community

Mihir3009

Paper submitter about 17 hours ago

🚀 Excited to share that our new paper, "PlanGEN", is now out!

We introduce a novel, model-agnostic, and easily scalable/adaptable multi-agent framework utilizing inference-time algorithms designed to generate natural planning and reasoning trajectories to solve complex tasks. By integrating three specialized agents—Constraint, Verification, and Selection—PlanGEN dynamically adapts various inference-time algorithms to effectively solve given problem instances. This means we can:

Extract instance-specific constraints tailored to the planning and reasoning problem,
Evaluate candidate plans with detailed, constraint-driven verification, and
Select the optimal inference-time algorithm based on the task's complexity.

Our experiments show significant performance gains across multiple benchmarks, including NATURAL PLAN, OlympiadBench, DocFinQA, and GPQA.

librarian-bot

about 14 hours ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2502.16111 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2502.16111 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2502.16111 in a Space README.md to link it from this page.