arxiv:2503.02682

MPO: Boosting LLM Agents with Meta Plan Optimization

Published on Mar 4

· Submitted by

xwm on Mar 5

#2 Paper of the day

Upvote

Authors:

Weimin Xiong ,

Qingxiu Dong ,

Bingchan Zhao ,

Feifan Song ,

Sujian Li

Abstract

Recent advancements in large language models (LLMs) have enabled LLM-based agents to successfully tackle interactive planning tasks. However, despite their successes, existing approaches often suffer from planning hallucinations and require retraining for each new agent. To address these challenges, we propose the Meta Plan Optimization (MPO) framework, which enhances agent planning capabilities by directly incorporating explicit guidance. Unlike previous methods that rely on complex knowledge, which either require significant human effort or lack quality assurance, MPO leverages high-level general guidance through meta plans to assist agent planning and enables continuous optimization of the meta plans based on feedback from the agent's task execution. Our experiments conducted on two representative tasks demonstrate that MPO significantly outperforms existing baselines. Moreover, our analysis indicates that MPO provides a plug-and-play solution that enhances both task completion efficiency and generalization capabilities in previous unseen scenarios.

View arXiv page View PDF GitHub repository Add to collection

Community

xwm

Paper author Paper submitter 2 days ago

The implementation is open-sourced at https://github.com/WeiminXiong/MPO.
The keypoints we want to make for MPO:

Introduce explicit guidance to steer the agent's planning process, with the ability to refine the guidance based on agent feedback.
Serve as a plug-and-play module that significantly enhances the performance of any agent on tasks without requiring retraining.
Effectively improve the agent's generalization in out-of-distribution tests while boosting task completion efficiency.