HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
Abstract
A hierarchical multi-agent framework named HiMAP-Travel addresses long-horizon planning challenges in travel planning by decomposing tasks into strategic coordination and parallel execution, achieving superior performance over sequential and other multi-agent baselines.
Sequential LLM agents fail on long-horizon planning with hard constraints like budgets and diversity requirements. As planning progresses and context grows, these agents drift from global constraints. We propose HiMAP-Travel, a hierarchical multi-agent framework that splits planning into strategic coordination and parallel day-level execution. A Coordinator allocates resources across days, while Day Executors plan independently in parallel. Three key mechanisms enable this: a transactional monitor enforcing budget and uniqueness constraints across parallel agents, a bargaining protocol allowing agents to reject infeasible sub-goals and trigger re-planning, and a single policy trained with GRPO that powers all agents through role conditioning. On TravelPlanner, HiMAP-Travel with Qwen3-8B achieves 52.78% validation and 52.65% test Final Pass Rate (FPR). In a controlled comparison with identical model, training, and tools, it outperforms the sequential DeepTravel baseline by +8.67~pp. It also surpasses ATLAS by +17.65~pp and MTP by +10.0~pp. On FlexTravelBench multi-turn scenarios, it achieves 44.34% (2-turn) and 37.42% (3-turn) FPR while reducing latency 2.5x through parallelization.
Community
This paper introduces HiMAP-Travel, an end-to-end RL framework for hierarchical multi-agent planning. By separating strategic coordination from parallel day-level execution, it mitigates constraint drift in long-horizon LLM agents and improves feasibility and efficiency on travel planning benchmarks.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Beyond Entangled Planning: Task-Decoupled Planning for Long-Horizon Agents (2026)
- HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents (2026)
- Budget-Aware Agentic Routing via Boundary-Guided Training (2026)
- Budget-Constrained Agentic Large Language Models: Intention-Based Planning for Costly Tool Use (2026)
- MagicAgent: Towards Generalized Agent Planning (2026)
- DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints (2026)
- TodoEvolve: Learning to Architect Agent Planning Systems (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper