Publications

View my profile on google scholar.

View my profile on dblp.

View my profile on orcid.

View my profile on semantic scholar.

View my profile on researchgate.

Dissertation

Target Assignment and Path Planning for Navigation Tasks with Teams of Agents.
Hang Ma.
PhD Thesis, University of Southern California, 2020.
[ pdf] [ bibtex] [ award talk]
ICAPS 2021 Best Dissertation Award
IFAAMAS 2020 Victor Lesser Distinguished Dissertation Award Runner-Up
```
@phdthesis{MaThesis20,
 author = {Hang Ma},
 school = {University of Southern California},
 title = {Target Assignment and Path Planning for Navigation Tasks with Teams of Agents},
 year = {2020}
}
```

2025

Space-Time Graphs of Convex Sets for Multi-Robot Motion Planning.
Jingtao Tang, Zining Mao, Lufan Yang and Hang Ma.
IEEE/RSJ International Conference on Intelligent Robots and System (IROS), Hangzhou, China, October 2025.
[ pdf] [ abstract] [ bibtex] [ project] [ github]
Best Paper Award at the RSS 2025 Workshop on Scalable and Resilient Multi-Robot Systems
```
@inproceedings{TangIROS25,
 author = {Jingtao Tang and Zining Mao and Lufan Yang and Hang Ma},
 booktitle = {{IEEE/RSJ} International Conference on Intelligent Robots and System},
 pages = {(in print)},
 title = {Space-Time Graphs of Convex Sets for Multi-Robot Motion Planning},
 year = {2025}
}
```
Reevaluation of Large Neighborhood Search for MAPF: Findings and Opportunities.
Jiaqi Tan*, Yudong Luo*, Jiaoyang Li and Hang Ma. (* Equal Contribution)
International Symposium on Combinatorial Search (SoCS), Glasgow, United Kingdom, August 2025.
[ pdf] [ abstract] [ bibtex] [ github] (Position Paper)
Multi-Agent Path Finding (MAPF) aims to arrange collision-free goal-reaching paths for a group of agents. Anytime MAPF solvers based on large neighborhood search (LNS) have gained prominence recently due to their flexibility and scalability, leading to a surge of methods, especially those leveraging machine learning, to enhance neighborhood selection. However, several pitfalls exist and hinder a comprehensive evaluation of these new methods, which mainly include: 1) Lower than actual or incorrect baseline performance; 2) Lack of a unified evaluation setting and criterion; 3) Lack of a codebase or executable model for supervised learning methods. To address these challenges, we introduce a unified evaluation framework, implement prior methods, and conduct an extensive comparison of prominent methods. Our evaluation reveals that rule-based heuristics serve as strong baselines, while current learning-based methods show no clear advantage on time efficiency or improvement capacity. Our extensive analysis also opens up new research opportunities for improving MAPF-LNS, such as targeting high-delayed agents, applying contextual algorithms, optimizing replan order and neighborhood size, where machine learning can potentially be integrated.
```
@inproceedings{TanSOCS25,
 author = {Jiaqi Tan and Yudong Luo and Jiaoyang Li and Hang Ma},
 booktitle = {International Symposium on Combinatorial Search},
 pages = {(in print)},
 title = {Reevaluation of Large Neighborhood Search for MAPF: Findings and Opportunities},
 year = {2025}
}
```
Large-Scale Multirobot Coverage Path Planning on Grids With Path Deconfliction.
Jingtao Tang, Zining Mao and Hang Ma.
IEEE Transactions on Robotics (T-RO) 41, May 2025.
[ pdf] [ abstract] [ bibtex] [ publisher] [ project] [ github]
We study Multi-Robot Coverage Path Planning (MCPP) on a 4-neighbor 2D grid $G$, which aims to compute paths for multiple robots to cover all cells of $G$. Traditional approaches are limited as they first compute coverage trees on a quadrant coarsened grid $\mathcalH$ and then employ the Spanning Tree Coverage (STC) paradigm to generate paths on $G$, making them inapplicable to grids with partially obstructed $2 \times 2$ blocks. To address this limitation, we reformulate the problem directly on $G$, revolutionizing grid-based MCPP solving and establishing new NP-hardness results. We introduce Extended-STC (ESTC), a novel paradigm that extends STC to ensure complete coverage with bounded suboptimality, even when $\mathcalH$ includes partially obstructed blocks. Furthermore, we present LS-MCPP, a new algorithmic framework that integrates ESTC with three novel types of neighborhood operators within a local search strategy to optimize coverage paths directly on $G$. Unlike prior grid-based MCPP work, our approach also incorporates a versatile post-processing procedure that applies Multi-Agent Path Finding (MAPF) techniques to MCPP for the first time, enabling a fusion of these two important fields in multi-robot coordination. This procedure effectively resolves inter-robot conflicts and accommodates turning costs by solving a MAPF variant, making our MCPP solutions more practical for real-world applications. Extensive experiments demonstrate that our approach significantly improves solution quality and efficiency, managing up to 100 robots on grids as large as $256 \times 256$ within minutes of runtime. Validation with physical robots confirms the feasibility of our solutions under real-world conditions. A project page with code, demo videos, and additional resources is available at: https://sites.google.com/view/lsmcpp.
```
@article{TangTRO25,
 author = {Jingtao Tang and Zining Mao and Hang Ma},
 journal = {IEEE Transactions on Robotics},
 pages = {3348--3367},
 title = {Large-Scale Multirobot Coverage Path Planning on Grids With Path Deconfliction},
 volume = {41},
 year = {2025}
}
```

2024

MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation.
Qiushi Lin and Hang Ma.
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Abu Dhabi, United Arab Emirates, October 2024.
[ pdf] [ abstract] [ bibtex] [ github]
We study a decentralized version of Moving Agents in Formation (MAiF), a variant of Multi-Agent Path Finding aiming to plan collision-free paths for multiple agents with the dual objectives of reaching their goals quickly while maintaining a desired formation. The agents must balance these objectives under conditions of partial observation and limited communication. The formation maintenance depends on the joint state of all agents, whose dimensionality increases exponentially with the number of agents, rendering the learning process intractable. Additionally, learning a single policy that can accommodate different linear preferences for these two objectives presents a significant challenge. In this paper, we propose Mean-Field Control with Envelop Q-learning (MFC-EQ), a scalable and adaptable learning framework for this bi-objective multi-agent problem. We approximate the dynamics of all agents using mean-field theory while learning a universal preference-agnostic policy through envelop Q-learning. Our empirical evaluation of MFC-EQ across numerous instances shows that it outperforms state-of-the-art centralized MAiF baselines. Furthermore, MFC-EQ effectively handles more complex scenarios where the desired formation changes dynamically -- a challenge that existing MAiF planners cannot address.
```
@inproceedings{LinIROS24,
 author = {Qiushi Lin and Hang Ma},
 booktitle = {IEEE/RSJ International Conference on Intelligent Robots and Systems},
 pages = {14156--14163},
 title = {MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation},
 year = {2024}
}
```
MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping.
Jiacheng Chen*, Yuefan Wu*, Jiaqi Tan*, Hang Ma and Yasutaka Furukawa. (* Equal Contribution)
European Conference on Computer Vision (ECCV), Milan, Italy, October 2024.
[ pdf] [ abstract] [ bibtex] [ project] [ github]
This paper presents a vector HD-mapping algorithm that formulates the mapping as a tracking task and uses a history of memory latents to ensure consistent reconstructions over time. Our method, MapTracker, accumulates a sensor stream into memory buffers of two latent representations: 1) Raster latents in the birdâ€™s-eye-view (BEV) space and 2) Vector latents over the road elements (i.e., pedestrian-crossings, lane-dividers, and road-boundaries). The approach borrows the query propagation paradigm from the tracking literature that explicitly associates tracked road elements from the previous frame to the current, while fusing a subset of memory latents selected with distance strides to further enhance temporal consistency. A vector latent is decoded to reconstruct the geometry of a road element. The paper further makes benchmark contributions by 1) Improving processing code for existing datasets to produce consistent ground truth with temporal alignments and 2) Augmenting existing mAP metrics with consistency checks. MapTracker significantly outperforms existing methods on both nuScenes and Agroverse2 datasets by over 8% and 19% on the conventional and the new consistency-aware metrics, respectively. The code will be available on our project page: https://map-tracker.github.io.
```
@inproceedings{ChenECCV24,
 author = {Jiacheng Chen and Yuefan Wu and Jiaqi Tan and Hang Ma and Yasutaka Furukawa},
 booktitle = {European Conference on Computer Vision},
 pages = {90--107},
 title = {MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping},
 year = {2024}
}
```
Multi-Robot Connected Fermat Spiral Coverage.
Jingtao Tang and Hang Ma.
International Conference on Automated Planning and Scheduling (ICAPS), Banff, Canada, June 2024.
[ pdf] [ abstract] [ bibtex] [ github]
We introduce the Multi-Robot Connected Fermat Spiral (MCFS), a novel algorithmic framework for Multi-Robot Coverage Path Planning (MCPP) that adapts Connected Fermat Spiral (CFS) from the computer graphics community to multi-robot coordination for the first time. MCFS uniquely enables the orchestration of multiple robots to generate coverage paths that contour around arbitrarily shaped obstacles, a feature that is notably lacking in traditional methods. Our framework not only enhances area coverage and optimizes task performance, particularly in terms of makespan, for workspaces rich in irregular obstacles but also addresses the challenges of path continuity and curvature critical for non-holonomic robots by generating smooth paths without decomposing the workspace. MCFS solves MCPP by constructing a graph of isolines and transforming MCPP into a combinatorial optimization problem, aiming to minimize the makespan while covering all vertices. Our contributions include developing a unified CFS version for scalable and adaptable MCPP, extending it to MCPP with novel optimization techniques for cost reduction and path continuity and smoothness, and demonstrating through extensive experiments that MCFS outperforms existing MCPP methods in makespan, path curvature, coverage ratio, and overlapping ratio. Our research marks a significant step in MCPP, showcasing the fusion of computer graphics and automated planning principles to advance the capabilities of multi-robot systems in complex environments. Our code is publicly available at https://github.com/reso1/MCFS.
```
@inproceedings{TangICAPS24,
 author = {Jingtao Tang and Hang Ma},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {579--587},
 title = {Multi-Robot Connected Fermat Spiral Coverage},
 year = {2024}
}
```
Large-Scale Multi-Robot Coverage Path Planning via Local Search.
Jingtao Tang and Hang Ma.
AAAI Conference on Artificial Intelligence (AAAI), Vancouver, Canada, Feburary 2024.
[ pdf] [ abstract] [ bibtex] [ video] [ github]
We study graph-based Multi-Robot Coverage Path Planning (MCPP) that aims to compute coverage paths for multiple robots to cover all vertices of a given 2D grid terrain graph $G$. Existing graph-based MCPP algorithms first compute a tree cover on $G$---a forest of multiple trees that cover all vertices---and then employ the Spanning Tree Coverage (STC) paradigm to generate coverage paths on the decomposed graph $D$ of the terrain graph $G$ by circumnavigating the edges of the computed trees, aiming to optimize the makespan (i.e., the maximum coverage path cost among all robots). In this paper, we take a different approach by exploring how to systematically search for good coverage paths directly on $D$. We introduce a new algorithmic framework, called LS-MCPP, which leverages a local search to operate directly on $D$. We propose a novel standalone paradigm, Extended-STC (ESTC), that extends STC to achieve complete coverage for MCPP on any decomposed graphs, even those resulting from incomplete terrain graphs. Furthermore, we demonstrate how to integrate ESTC with three novel types of neighborhood operators into our framework to effectively guide its search process. Our extensive experiments demonstrate the effectiveness of LS-MCPP, consistently improving the initial solution returned by two state-of-the-art baseline algorithms that compute suboptimal tree covers on $G$, with a notable reduction in makespan by up to 35.7% and 30.3%, respectively. Moreover, LS-MCPP consistently matches or surpasses the results of optimal tree cover computation, achieving these outcomes with orders of magnitude faster runtime, thereby showcasing its significant benefits for large-scale real-world coverage tasks.
```
@inproceedings{TangAAAI24,
 author = {Jingtao Tang and Hang Ma},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {17567--17574},
 title = {Large-Scale Multi-Robot Coverage Path Planning via Local Search},
 year = {2024}
}
```

2023

Mixed Integer Programming for Time-Optimal Multi-Robot Coverage Path Planning with Efficient Heuristics.
Jingtao Tang and Hang Ma.
IEEE Robotics and Automation Letters (RA-L) 8(10), October 2023.
[ pdf] [ abstract] [ bibtex] [ video] [ publisher] [ github]
We investigate time-optimal Multi-Robot Coverage Path Planning (MCPP) for both unweighted and weighted terrains, which aims to minimize the coverage time, defined as the maximum travel time of all robots. Specifically, we focus on a reduction from MCPP to Min-Max Rooted Tree Cover (MMRTC). For the first time, we propose a Mixed Integer Programming (MIP) model to optimally solve MMRTC, resulting in an MCPP solution with a coverage time that is provably at most four times the optimal. Moreover, we propose two suboptimal yet effective heuristics that reduce the number of variables in the MIP model, thus improving its efficiency for large-scale MCPP instances. We show that both heuristics result in reduced-size MIP models that remain complete (i.e., guaranteed to find a solution if one exists) for all MMRTC instances. Additionally, we explore the use of model optimization warm-startup to further improve the efficiency of both the original MIP model and the reduced-size MIP models. We validate the effectiveness of our MIP-based MCPP planner through experiments that compare it with two state-of-the-art MCPP planners on various instances, demonstrating a reduction in the coverage time by an average of $27.65%$ and $23.24%$ over them, respectively.
```
@article{TangRAL23,
 author = {Jingtao Tang and Hang Ma},
 journal = {IEEE Robotics and Automation Letters},
 number = {10},
 pages = {6491--6498},
 title = {Mixed Integer Programming for Time-Optimal Multi-Robot Coverage Path Planning with Efficient Heuristics},
 volume = {8},
 year = {2023}
}
```
SACHA: Soft Actor-Critic with Heuristic-Based Attention for Partially Observable Multi-Agent Path Finding.
Qiushi Lin and Hang Ma.
IEEE Robotics and Automation Letters (RA-L) 8(8), August 2023.
[ pdf] [ abstract] [ bibtex] [ publisher] [ github]
Multi-Agent Path Finding (MAPF) is a crucial component for many large-scale robotic systems, where agents must plan their collision-free paths to their given goal positions. Recently, multi-agent reinforcement learning has been introduced to solve the partially observable variant of MAPF by learning a decentralized single-agent policy in a centralized fashion based on each agent's partial observation. However, existing learning-based methods are ineffective in achieving complex multi-agent cooperation, especially in congested environments, due to the non-stationarity of this setting. To tackle this challenge, we propose a multi-agent actor-critic method called Soft Actor-Critic with Heuristic-Based Attention (SACHA), which employs novel heuristic-based attention mechanisms for both the actors and critics to encourage cooperation among agents. SACHA learns a neural network for each agent to selectively pay attention to the shortest path heuristic guidance from multiple agents within its field of view, thereby allowing for more scalable learning of cooperation. SACHA also extends the existing multi-agent actor-critic framework by introducing a novel critic centered on each agent to approximate $Q$-values. Compared to existing methods that use a fully observable critic, our agent-centered multi-agent actor-critic method results in more impartial credit assignment and better generalizability of the learned policy to MAPF instances with varying numbers of agents and types of environments. We also implement SACHA(C), which embeds a communication module in the agent's policy network to enable information exchange among agents. We evaluate both SACHA and SACHA(C) on a variety of MAPF instances and demonstrate decent improvements over several state-of-the-art learning-based MAPF methods with respect to success rate and solution quality.
```
@article{LinRAL23,
 author = {Qiushi Lin and Hang Ma},
 journal = {IEEE Robotics and Automation Letters},
 number = {8},
 pages = {2377--3766},
 title = {SACHA: Soft Actor-Critic with Heuristic-Based Attention for Partially Observable Multi-Agent Path Finding},
 volume = {8},
 year = {2023}
}
```
Priority-Based Search for the Virtual Network Embedding Problem.
Yi Zheng, Hang Ma, Sven Koenig, Erik Kline and T. K. Satish Kumar.
International Conference on Automated Planning and Scheduling (ICAPS), Prague, Czech Republic, July 2023.
[ pdf] [ abstract] [ bibtex]
The Virtual Network Embedding (VNE) problem is a constrained optimization problem. It arises in the context of allocating resources on heterogeneous physical networks to provide end-to-end computing services. In this paper, we introduce a new solver, called VNE-PBS, that uses priority-based search (PBS) for solving the VNE problem. VNE-PBS uses a prioritized heuristic search algorithm that explores the space of all possible priority orderings using a systematic depth-first search. The solver is inspired by the success of PBS for the Multi-Agent Path Finding (MAPF) problem and the similarities between the VNE and MAPF problems. We show that VNE-PBS significantly outperforms competing methods on various benchmark instances for both the offline and online versions of the VNE problem.
```
@inproceedings{ZhengICAPS23,
 author = {Yi Zheng and Hang Ma and Sven Koenig and Erik Kline and T. K. Satish Kumar},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {472--480},
 title = {Priority-Based Search for the Virtual Network Embedding Problem},
 year = {2023}
}
```
Double-Deck Multi-Agent Pickup and Delivery: Multi-Robot Rearrangement in Large-Scale Warehouses.
Baiyu Li and Hang Ma.
IEEE Robotics and Automation Letters (RA-L) 8(6), June 2023.
[ pdf] [ abstract] [ bibtex] [ video] [ publisher]
We introduce a new problem formulation, Double-Deck Multi-Agent Pickup and Delivery (DD-MAPD), which models the multi-robot shelf rearrangement problem in automated warehouses. DD-MAPD extends both Multi-Agent Pickup and Delivery (MAPD) and Multi-Agent Path Finding (MAPF) by allowing agents to move beneath shelves or lift and deliver a shelf to an arbitrary location, thereby changing the warehouse layout. We show that solving DD-MAPD is NP-hard. To tackle DD-MAPD, we propose MAPF-DECOMP, an algorithmic framework that decomposes a DD-MAPD instance into a MAPF instance for coordinating shelf trajectories and a subsequent MAPD instance with task dependencies for computing paths for agents. We also present an optimization technique to improve the performance of MAPF-DECOMP and demonstrate how to make MAPF-DECOMP complete for well-formed DD-MAPD instances, a realistic subclass of DD-MAPD instances. Our experimental results demonstrate the efficiency and effectiveness of MAPF-DECOMP, with the ability to compute high-quality solutions for large-scale instances with over one thousand shelves and hundreds of agents in just minutes of runtime.
```
@article{LiRAL23,
 author = {Baiyu Li and Hang Ma},
 journal = {IEEE Robotics and Automation Letters},
 number = {6},
 pages = {3701--3708},
 title = {Double-Deck Multi-Agent Pickup and Delivery: Multi-Robot Rearrangement in Large-Scale Warehouses},
 volume = {8},
 year = {2023}
}
```

2022

Intelligent Planning for Large-Scale Multi-Agent Systems.
Hang Ma.
AI Magazine 43(4), December 2022.
[ pdf] [ abstract] [ bibtex] [ publisher]
This article summarizes the New Faculty Highlights talk with the same title at AAAI 2021. Intelligent agents such as different types of robots will soon become an integral part of our daily lives. In real-world multi-agent systems, the most fundamental challenges are assigning tasks to multiple agents (task-level coordination problems) and planning collision-free paths for the agents to task locations (motion-level coordination problems). This article surveys four directions of our research on using intelligent planning techniques for the above multi-agent coordination problems.
```
@article{MaAIM22,
 author = {Hang Ma},
 journal = {AI Magazine},
 number = {4},
 pages = {376--382},
 title = {Intelligent Planning for Large-Scale Multi-Agent Systems},
 volume = {43},
 year = {2022}
}
```
Multi-Goal Multi-Agent Pickup and Delivery.
Qinghong Xu, Jiaoyang Li, Sven Koenig and Hang Ma.
IEEE/RSJ International Conference on Intelligent Robots and System (IROS), Kyoto, Japan, October 2022.
[ pdf] [ abstract] [ bibtex] [ video]
In this work, we consider the Multi-Agent Pickup-and-Delivery (MAPD) problem, where agents constantly engage with new tasks and need to plan collision-free paths to execute them. To execute a task, an agent needs to visit a pair of goal locations, consisting of a pickup location and a delivery location. We propose two variants of an algorithm that assigns a sequence of tasks to each agent using the anytime algorithm Large Neighborhood Search (LNS) and plans paths using the Multi-Agent Path Finding (MAPF) algorithm Priority-Based Search (PBS). LNS-PBS is complete for well-formed MAPD instances, a realistic subclass of MAPD instances, and empirically more effective than the existing complete MAPD algorithm CENTRAL. LNS-wPBS provides no completeness guarantee but is empirically more efficient and stable than LNS-PBS. It scales to thousands of agents and thousands of tasks in a large warehouse and is empirically more effective than the existing scalable MAPD algorithm HBH+MLA*. LNS-PBS and LNS-wPBS also apply to a more general variant of MAPD, namely the Multi-Goal MAPD (MG-MAPD) problem, where tasks can have different numbers of goal locations.
```
@inproceedings{XuIROS22,
 author = {Qinghong Xu and Jiaoyang Li and Sven Koenig and Hang Ma},
 booktitle = {{IEEE/RSJ} International Conference on Intelligent Robots and System},
 pages = {9964--9971},
 title = {Multi-Goal Multi-Agent Pickup and Delivery},
 year = {2022}
}
```
Graph-Based Multi-Robot Path Finding and Planning.
Hang Ma.
Current Robotics Reports 3(3), June 2022.
[ pdf] [ abstract] [ bibtex] [ publisher]
Purpose of Review Planning collision-free paths for multiple robots is important for real-world multi-robot systems and has been studied as an optimization problem on graphs, called multi-agent path finding (MAPF). This review surveys different categories of classic and state-of-the-art MAPF algorithms and different research attempts to tackle the challenges of generalizing MAPF techniques to real-world scenarios. Recent Findings Solving MAPF problems optimally is computationally challenging. Recent advances have resulted in MAPF algorithms that can compute collision-free paths for hundreds of robots and thousands of navigation tasks in seconds of runtime. Many variants of MAPF have been formalized to adapt MAPF techniques to different real-world requirements, such as considerations of robot kinematics, online optimization for real-time systems, and the integration of task assignment and path planning. Summary Algorithmic techniques for MAPF problems have addressed important aspects of several multi-robot applications, including automated warehouse fulfillment and sortation, automated train scheduling, and navigation of non-holonomic robots and quadcopters. This showcases their potential for real-world applications of large-scale multi-robot systems.
```
@article{MaCRR22,
 author = {Hang Ma},
 journal = {Current Robotics Reports},
 number = {3},
 pages = {77--84},
 title = {Graph-Based Multi-Robot Path Finding and Planning},
 volume = {3},
 year = {2022}
}
```
Optimal and Bounded-Suboptimal Multi-Goal Task Assignment and Path Finding.
Xinyi Zhong, Jiaoyang Li, Sven Koenig and Hang Ma.
IEEE International Conference on Robotics and Automation (ICRA), Philadelphia, PA, May 2022.
[ pdf] [ abstract] [ bibtex] [ video]
We formalize and study the multi-goal task assignment and pathfinding (MG-TAPF) problem from theoretical and algorithmic perspectives. The MG-TAPF problem is to compute an assignment of tasks to agents, where each task consists of a sequence of goal locations, and collision-free paths for the agents that visit all goal locations of their assigned tasks in sequence. Theoretically, we prove that the MG-TAPF problem is NP-hard to solve optimally. We present algorithms that build upon algorithmic techniques for the multi-agent pathfinding problem and solve the MG-TAPF problem optimally and bounded-suboptimally. We experimentally compare these algorithms on a variety of different benchmark domains.
```
@inproceedings{ZhongICRA22,
 author = {Xinyi Zhong and Jiaoyang Li and Sven Koenig and Hang Ma},
 booktitle = {IEEE International Conference on Robotics and Automation},
 pages = {10731--10737},
 title = {Optimal and Bounded-Suboptimal Multi-Goal Task Assignment and Path Finding},
 year = {2022}
}
```

2021

Pairwise Symmetry Reasoning for Multi-Agent Path Finding Search.
Jiaoyang Li, Daniel Harabor, Peter J. Stuckey, Hang Ma, Graeme Gange and Sven Koenig.
Artificial Intelligence (AIJ) 301, December 2021.
[ pdf] [ abstract] [ bibtex] [ publisher] [ github]
Multi-Agent Path Finding (MAPF) is a challenging combinatorial problem that asks us to plan collision-free paths for a team of cooperative agents. In this work, we show that one of the reasons why MAPF is so hard to solve is due to a phenomenon called pairwise symmetry, which occurs when two agents have many different paths to their target locations, all of which appear promising, but every combination of them results in a collision. We identify several classes of pairwise symmetries and show that each one arises commonly in practice and can produce an exponential explosion in the space of possible collision resolutions, leading to unacceptable runtimes for current state-of-the-art (bounded-sub)optimal MAPF algorithms. We propose a variety of reasoning techniques that detect the symmetries efficiently as they arise and resolve them by using specialized constraints to eliminate all permutations of pairwise colliding paths in a single branching step. We implement these ideas in the context of a leading optimal MAPF algorithm CBS and show that the addition of the symmetry reasoning techniques can have a dramatic positive effect on its performance â€” we report a reduction in the number of node expansions by up to four orders of magnitude and an increase in scalability by up to thirty times. These gains allow us to solve to optimality a variety of challenging MAPF instances previously considered out of reach for CBS.
```
@article{LiAIJ21,
 author = {Jiaoyang Li and Daniel Harabor and Peter J. Stuckey and Hang Ma and Graeme Gange and Sven Koenig},
 journal = {Artificial Intelligence},
 pages = {103574},
 title = {Pairwise Symmetry Reasoning for Multi-Agent Path Finding Search},
 volume = {301},
 year = {2021}
}
```
A Competitive Analysis of Online Multi-Agent Path Finding.
Hang Ma.
International Conference on Automated Planning and Scheduling (ICAPS), Guangzhou, China, August 2021.
[ pdf] [ abstract] [ bibtex] [ SoCS-21 talk]
We study online Multi-Agent Path Finding (MAPF), where new agents are constantly revealed over time and all agents must find collision-free paths to their given goal locations. We generalize existing complexity results of (offline) MAPF to online MAPF. We classify online MAPF algorithms into different categories based on (1) controllability (the set of agents that they can plan paths for at each time) and (2) rationality (the quality of paths they plan) and study the relationships between them. We perform a competitive analysis for each category of online MAPF algorithms with respect to commonlyused objective functions. We show that a naive algorithm that routes newly-revealed agents one at a time in sequence achieves a competitive ratio that is asymptotically bounded from both below and above by the number of agents with respect to flowtime and makespan. We then show a counterintuitive result that, if rerouting of previously-revealed agents is not allowed, any rational online MAPF algorithms, including ones that plan optimal paths for all newly-revealed agents, have the same asymptotic competitive ratio as the naive algorithm, even on 2D 4-neighbor grids. We also derive constant lower bounds on the competitive ratio of any rational online MAPF algorithms that allow rerouting. The results thus provide theoretical insights into the effectiveness of using MAPF algorithms in an online setting for the first time.
```
@inproceedings{MaICAPS21,
 author = {Hang Ma},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {234--242},
 title = {A Competitive Analysis of Online Multi-Agent Path Finding},
 year = {2021}
}
```
Scalable Rail Planning and Replanning: Winning the 2020 Flatland Challenge.
Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Daniel Harabor, Peter J. Stuckey, Hang Ma and Sven Koenig.
International Conference on Automated Planning and Scheduling (ICAPS), Guangzhou, China, August 2021.
[ pdf] [ abstract] [ bibtex] [ demo] [ NeurIPS-21 talk] [ SoCS-21 talk] [ video] [ github]
ICAPS 2021 People's Choice Best System Demonstration Award
Winner of Both Rounds of the NeurIPS-20 Flatland Challenge (Team An_Old_Driver)
Multi-Agent Path Finding (MAPF) is the combinatorial problem of finding collision-free paths for multiple agents on a graph. This paper describes MAPF-based software for solving train planning and replanning problems on large-scale rail networks under uncertainty. The software recently won the 2020 Flatland Challenge, a NeurIPS competition trying to determine how to efficiently manage dense traffic on rail networks. The software incorporates many state-of-theart MAPF or, in general, optimization technologies, such as prioritized planning, large neighborhood search, safe interval path planning, minimum communication policies, parallel computing, and simulated annealing. It can plan collisionfree paths for thousands of trains within a few minutes and deliver deadlock-free actions in real-time during execution.
```
@inproceedings{LiICAPS21,
 author = {Jiaoyang Li and Zhe Chen and Yi Zheng and Shao-Hung Chan and Daniel Harabor and Peter J. Stuckey and Hang Ma and Sven Koenig},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {477--485},
 title = {Scalable Rail Planning and Replanning: Winning the 2020 Flatland Challenge},
 year = {2021}
}
```
Distributed Heuristic Multi-Agent Path Finding with Communication.
Ziyuan Ma*, Yudong Luo* and Hang Ma. (* Equal Contribution)
IEEE International Conference on Robotics and Automation (ICRA), Xi'an, China, May 2021.
[ pdf] [ abstract] [ bibtex] [ video] [ github]
Multi-Agent Path Finding (MAPF) is essential to large-scale robotic systems. Recent methods have applied reinforcement learning (RL) to learn decentralized polices in partially observable environments. A fundamental challenge of obtaining collision-free policy is that agents need to learn cooperation to handle congested situations. This paper combines communication with deep Q-learning to provide a novel learning based method for MAPF, where agents achieve cooperation via graph convolution. To guide RL algorithm on long-horizon goal-oriented tasks, we embed the potential choices of shortest paths from single source as heuristic guidance instead of using a specific path as in most existing works. Our method treats each agent independently and trains the model from a single agentâ€™s perspective. The final trained policy is applied to each agent for decentralized execution. The whole system is distributed during training and is trained under a curriculum learning strategy. Empirical evaluation in obstacle-rich environment indicates the high success rate with low average step of our method.
```
@inproceedings{MaICRA21,
 author = {Ziyuan Ma and Yudong Luo and Hang Ma},
 booktitle = {IEEE International Conference on Robotics and Automation},
 pages = {8699--8705},
 title = {Distributed Heuristic Multi-Agent Path Finding with Communication},
 year = {2021}
}
```

2020

New Techniques for Pairwise Symmetry Breaking in Multi-Agent Path Finding.
Jiaoyang Li, Graeme Gange, Daniel Harabor, Peter J. Stuckey, Hang Ma and Sven Koenig.
International Conference on Automated Planning and Scheduling (ICAPS), Nancy, France, June 2020.
[ pdf] [ abstract] [ bibtex]
We consider two new classes of pairwise path symmetries which appear in the context of Multi-Agent Path Finding (MAPF). The first of them, corridor symmetry, arises when two agents attempt to pass through the same narrow passage in opposite directions. The second, target symmetry, arises when the shortest path of one agent passes through the target location of a second agent after the second agent has already arrived at it. These symmetries can produce an exponential explosion in the space of possible collision resolutions, leading to unacceptable runtimes even for state-of-the-art MAPF algorithms such as Conflict-Based Search (CBS). We propose to break these symmetries using new reasoning techniques that: (1) detect each class of symmetry and (2) resolve them by introducing specialized constraints. We experimentally show that our techniques can, in some cases, more than double the success rate of CBS and improve its runtime by one order of magnitude.
```
@inproceedings{LiICAPS20,
 author = {Jiaoyang Li and Graeme Gange and Daniel Harabor and Peter J. Stuckey and Hang Ma and Sven Koenig},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {193--201},
 title = {New Techniques for Pairwise Symmetry Breaking in Multi-Agent Path Finding},
 year = {2020}
}
```
Moving Agents in Formation in Congested Environments.
Jiaoyang Li, Kexuan Sun, Hang Ma, Ariel Felner, T. K. Satish Kumar and Sven Koenig.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, May 2020.
[ pdf] [ abstract] [ bibtex] [ video]
In this paper, we formalize and study the Moving Agents in Formation (MAiF) problem, that combines the tasks of finding short collision-free paths for multiple agents and keeping them in close adherence to a desired formation. Previous work includes controller-based algorithms, swarm-based algorithms, and potential-field-based algorithms. They usually focus on only one or the other of these tasks, solve the problem greedily without systematic search, and thus generate costly solutions or even fail to find solutions in congested environments. In this paper, we develop a two-phase search algorithm, called SWARM-MAPF, whose first phase is inspired by swarm-based algorithms (in open regions) and whose second phase is inspired by multi-agent path-finding (MAPF) algorithms (in congested regions). In the first phase, SWARM-MAPF selects a leader among the agents and finds a path for it that is sufficiently far away from the obstacles so that the other agents can preserve the desired formation around it. It also identifies the critical segments of the leaderâ€™s path where the other agents cannot preserve the desired formation and the refinement of which has thus to be delegated to the second phase. In the second phase, SWARM-MAPF refines these segments. Theoretically, we prove that SWARM-MAPF is complete. Empirically, we show that SWARM-MAPF scales well and is able to find close-to-optimal solutions.
```
@inproceedings{LiAAMAS20,
 author = {Jiaoyang Li and Kexuan Sun and Hang Ma and Ariel Felner and T. K. Satish Kumar and Sven Koenig},
 booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
 pages = {726--734},
 title = {Moving Agents in Formation in Congested Environments},
 year = {2020}
}
```
Idle Time Optimization for Target Assignment and Path Finding in Sortation Centers.
Ngai Meng Kou, Cheng Peng, Hang Ma, T. K. Satish Kumar and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), New York, NY, February 2020.
[ pdf] [ abstract] [ bibtex] [ media: Aliyun (Chinese)] [ media: Synced (Chinese)]
In this paper, we study the one-shot and lifelong versions of the Target Assignment and Path Finding problem in automated sortation centers, where each agent needs to constantly assign itself a sorting station, move to its assigned station without colliding with obstacles or other agents, wait in the queue of that station to obtain a parcel for delivery, and then deliver the parcel to a sorting bin. The throughput of such centers is largely determined by the total idle time of all stations since their queues can frequently become empty. To address this problem, we first formalize and study the oneshot version that assigns stations to a set of agents and finds collision-free paths for the agents to their assigned stations. We present efficient algorithms for this task based on a novel min-cost max-flow formulation that minimizes the total idle time of all stations in a fixed time window. We then demonstrate how our algorithms for solving the one-shot problem can be applied to solving the lifelong problem as well. Experimentally, we believe to be the first researchers to consider real-world automated sortation centers using an industrial simulator with realistic data and a kinodynamic model of real robots. On this simulator, we showcase the benefits of our algorithms by demonstrating their efficiency and effectiveness for up to 350 agents.
```
@inproceedings{KouAAAI20,
 author = {Ngai Meng Kou and Cheng Peng and Hang Ma and T. K. Satish Kumar and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {9925--9932},
 title = {Idle Time Optimization for Target Assignment and Path Finding in Sortation Centers},
 year = {2020}
}
```

2019

Improved Heuristics for Multi-Agent Path Finding with Conflict-Based Search.
Jiaoyang Li, Eli Boyarski, Ariel Felner, Hang Ma and Sven Koenig.
International Joint Conference on Artificial Intelligence (IJCAI), Macao, China, August 2019.
[ pdf] [ abstract] [ bibtex] [ github]
Conflict-Based Search (CBS) and its enhancements are among the strongest algorithms for Multi-Agent Path Finding. Recent work introduced an admissible heuristic to guide the high-level search of CBS. In this work, we prove the limitation of this heuristic, as it is based on cardinal conflicts only. We then introduce two new admissible heuristics by reasoning about the pairwise dependencies between agents. Empirically, CBS with either new heuristic significantly improves the success rate over CBS with the recent heuristic and reduces the number of expanded nodes and runtime by up to a factor of 50.
```
@inproceedings{LiIJCAI19,
 author = {Jiaoyang Li and Eli Boyarski and Ariel Felner and Hang Ma and Sven Koenig},
 booktitle = {International Joint Conference on Artificial Intelligence},
 pages = {442--449},
 title = {Improved Heuristics for Multi-Agent Path Finding with Conflict-Based Search},
 year = {2019}
}
```
Disjoint Splitting for Conflict-Based Search for Multi-Agent Path Finding.
Jiaoyang Li, Daniel Harabor, Peter J. Stuckey, Ariel Felner, Hang Ma and Sven Koenig.
International Conference on Automated Planning and Scheduling (ICAPS), Berkeley, CA, July 2019.
[ pdf] [ abstract] [ bibtex]
Multi-Agent Path Finding (MAPF) is the planning problem of finding collision-free paths for a team of agents. We focus on Conflict-Based Search (CBS), a two-level tree-search state-of-the-art MAPF algorithm. The standard splitting strategy used by CBS is not disjoint, i.e., when it splits a problem into two subproblems, some solutions are shared by both subproblems, which can create duplication of search effort. In this paper, we demonstrate how to improve CBS with disjoint splitting and how to modify the low-level search of CBS to take maximal advantage of it. Experiments show that disjoint splitting increases the success rates and speeds of CBS and its variants by up to 2 orders of magnitude.
```
@inproceedings{LiICAPS19,
 author = {Jiaoyang Li and Daniel Harabor and Peter J. Stuckey and Ariel Felner and Hang Ma and Sven Koenig},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {279--283},
 title = {Disjoint Splitting for Conflict-Based Search for Multi-Agent Path Finding},
 year = {2019}
}
```
Multi-Agent Pathfinding: Definitions, Variants, and Benchmarks.
Roni Stern, Nathan Sturtevant, Ariel Felner, Sven Koenig, Hang Ma, Thayne Walker, Jiaoyang Li, Dor Atzmon, Liron Cohen, T. K. Satish Kumar, Eli Boyarski and Roman Bartak.
International Symposium on Combinatorial Search (SoCS), Napa, CA, July 2019.
[ pdf] [ abstract] [ bibtex] (Position Paper)
The Multi-Agent Pathfinding (MAPF) problem is the fundamental problem of planning paths for multiple agents, where the key constraint is that the agents will be able to follow these paths concurrently without colliding with each other. Applications of MAPF include automated warehouses and autonomous vehicles. Research on MAPF has been flourishing in the past couple of years. Different MAPF research papers make different assumptions, e.g., whether agents can traverse the same road at the same time, and have different objective functions, e.g., minimize makespan or sum of agentsâ€™ actions costs. These assumptions and objectives are sometimes implicitly assumed or described informally. This makes it difficult to establish appropriate baselines for comparison in research papers, as well as making it difficult for practitioners to find the papers relevant to their concrete application. This paper aims to fill this gap and support researchers and practitioners by providing a unifying terminology for describing common MAPF assumptions and objectives. In addition, we also provide pointers to two MAPF benchmarks. In particular, we introduce a new grid-based benchmark for MAPF, and demonstrate experimentally that it poses a challenge to contemporary MAPF algorithms.
```
@inproceedings{SternSOCS19,
 author = {Roni Stern and Nathan Sturtevant and Ariel Felner and Sven Koenig and Hang Ma and Thayne Walker and Jiaoyang Li and Dor Atzmon and Liron Cohen and T. K. Satish Kumar and Eli Boyarski and Roman Bartak},
 booktitle = {International Symposium on Combinatorial Search},
 pages = {151--159},
 title = {Multi-Agent Pathfinding: Definitions, Variants, and Benchmarks},
 year = {2019}
}
```
Task and Path Planning for Multi-Agent Pickup and Delivery.
Minghua Liu, Hang Ma, Jiaoyang Li and Sven Koenig.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Montreal, Canada, May 2019.
[ pdf] [ abstract] [ bibtex]
We study the offline Multi-Agent Pickup-and-Delivery (MAPD) problem, where a team of agents has to execute a batch of tasks with release times in a known environment. To execute a task, an agent has to move first from its current location to the pickup location of the task and then to the delivery location of the task. The MAPD problem is to assign tasks to agents and plan collision-free paths for them to execute their tasks. Online MAPD algorithms can be applied to the offline MAPD problem, but do not utilize all of the available information and may thus not be effective. Therefore, we present two novel offline MAPD algorithms that improve a state-of-the-art online MAPD algorithm with respect to task planning, path planning, and deadlock avoidance for the offline MAPD problem. Our MAPD algorithms first compute one task sequence for each agent by solving a special traveling salesman problem and then plan paths according to these task sequences. We also introduce an effective deadlock avoidance method, called "reserving dummy paths." Theoretically, our MAPD algorithms are complete for well-formed MAPD instances, a realistic subclass of all MAPD instances. Experimentally, they produce solutions of smaller makespans and scale better than the online MAPD algorithm in simulated warehouses with hundreds of robots and thousands of tasks.
```
@inproceedings{LiuAAMAS19,
 author = {Minghua Liu and Hang Ma and Jiaoyang Li and Sven Koenig},
 booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
 pages = {2253--2255},
 title = {Task and Path Planning for Multi-Agent Pickup and Delivery},
 year = {2019}
}
```
A New Constraint Satisfaction Perspective on Multi-Agent Path Finding.
Jiangxing Wang, Jiaoyang Li, Hang Ma, Sven Koenig and T. K. Satish Kumar.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Montreal, Canada, May 2019.
[ pdf] [ abstract] [ bibtex] (Extended Abstract)
In this paper, we adopt a new perspective on the Multi-Agent Path Finding (MAPF) problem and view it as a Constraint Satisfaction Problem (CSP). A variable corresponds to an agent, its domain is the set of all paths from the start vertex to the goal vertex of the agent, and the constraints allow only conflict-free paths for each pair of agents. Although the domains and constraints are only implicitly defined, this new CSP perspective allows us to use successful techniques from CSP search. With the concomitant idea of using matrix computations for calculating the size of the reduced domain of an uninstantiated variable, we apply Dynamic Variable Ordering and Rapid Random Restarts to the MAPF problem. In our experiments, the resulting simple polynomial-time MAPF solver, called Matrix MAPF solver, either outperforms or matches the performance of many state-of-the-art solvers for the MAPF problem and its variants.
```
@inproceedings{WangAAMAS19,
 author = {Jiangxing Wang and Jiaoyang Li and Hang Ma and Sven Koenig and T. K. Satish Kumar},
 booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
 pages = {417--423},
 title = {A New Constraint Satisfaction Perspective on Multi-Agent Path Finding},
 year = {2019}
}
```
Searching with Consistent Prioritization for Multi-Agent Path Finding.
Hang Ma, Daniel Harabor, Peter J. Stuckey, Jiaoyang Li and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, January 2019.
[ pdf] [ abstract] [ bibtex]
We study prioritized planning for Multi-Agent Path Finding (MAPF). Existing prioritized MAPF algorithms depend on rule-of-thumb heuristics and random assignment to determine a fixed total priority ordering of all agents a priori. We instead explore the space of all possible partial priority orderings as part of a novel systematic and conflict-driven combinatorial search framework. In a variety of empirical comparisons, we demonstrate state-of-the-art solution qualities and success rates, often with similar runtimes to existing algorithms. We also develop new theoretical results that explore the limitations of prioritized planning, in terms of completeness and optimality, for the first time.
```
@inproceedings{MaAAAI19a,
 author = {Hang Ma and Daniel Harabor and Peter J. Stuckey and Jiaoyang Li and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {7643--7650},
 title = {Searching with Consistent Prioritization for Multi-Agent Path Finding},
 year = {2019}
}
```
Lifelong Path Planning with Kinematic Constraints for Multi-Agent Pickup and Delivery.
Hang Ma, Wolfgang Hönig, T. K. Satish Kumar, Nora Ayanian and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, January 2019.
[ pdf] [ abstract] [ bibtex] [ video] [ supp]
The Multi-Agent Pickup and Delivery (MAPD) problem models applications where a large number of agents attend to a stream of incoming pickup-and-delivery tasks. Token Passing (TP) is a recent MAPD algorithm that is efficient and effective. We make TP even more efficient and effective by using a novel combinatorial search algorithm, called Safe Interval Path Planning with Reservation Table (SIPPwRT), for single-agent path planning. SIPPwRT uses an advanced data structure that allows for fast updates and lookups of the current paths of all agents in an online setting. The resulting MAPD algorithm TP-SIPPwRT takes kinematic constraints of real robots into account directly during planning, computes continuous agent movements with given velocities that work on non-holonomic robots rather than discrete agent movements with uniform velocity, and is complete for well-formed MAPD instances. We demonstrate its benefits for automated warehouses using both an agent simulator and a standard robot simulator. For example, we demonstrate that it can compute paths for hundreds of agents and thousands of tasks in seconds and is more efficient and effective than existing MAPD algorithms that use a post-processing step to adapt their paths to continuous agent movements with given velocities.
```
@inproceedings{MaAAAI19b,
 author = {Hang Ma and Wolfgang H\"{o}nig and T. K. Satish Kumar and Nora Ayanian and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {7651--7658},
 title = {Lifelong Path Planning with Kinematic Constraints for Multi-Agent Pickup and Delivery},
 year = {2019}
}
```
Multi-Agent Path Finding for Large Agents.
Jiaoyang Li, Pavel Surynek, Ariel Felner, Hang Ma, T. K. Satish Kumar and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, January 2019.
[ pdf] [ abstract] [ bibtex]
Multi-Agent Path Finding (MAPF) has been widely studied in the AI community. For example, Conflict-Based Search (CBS) is a state-of-the-art MAPF algorithm based on a twolevel tree-search. However, previous MAPF algorithms assume that an agent occupies only a single location at any given time, e.g., a single cell in a grid. This limits their applicability in many real-world domains that have geometric agents in lieu of point agents. Geometric agents are referred to as "large" agents because they can occupy multiple points at the same time. In this paper, we formalize and study LA-MAPF, i.e., MAPF for large agents. We first show how CBS can be adapted to solve LA-MAPF. We then present a generalized version of CBS, called Multi-Constraint CBS (MC-CBS), that adds multiple constraints (instead of one constraint) for an agent when it generates a high-level search node. We introduce three different approaches to choose such constraints as well as an approach to compute admissible heuristics for the high-level search. Experimental results show that all MC-CBS variants outperform CBS by up to three orders of magnitude in terms of runtime. The best variant also outperforms EPEA* (a state-of-the-art A*-based MAPF solver) in all cases and MDD-SAT (a state-of-the-art reduction-based MAPF solver) in some cases.
```
@inproceedings{LiAAAI19a,
 author = {Jiaoyang Li and Pavel Surynek and Ariel Felner and Hang Ma and T. K. Satish Kumar and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {7627--7634},
 title = {Multi-Agent Path Finding for Large Agents},
 year = {2019}
}
```
Symmetry-Breaking Constraints for Grid-Based Multi-Agent Path Finding.
Jiaoyang Li, Daniel Harabor, Peter J. Stuckey, Hang Ma and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, January 2019.
[ pdf] [ abstract] [ bibtex] [ github]
We describe a new way of reasoning about symmetric collisions for Multi-Agent Path Finding (MAPF) on 4-neighbor grids. We also introduce a symmetry-breaking constraint to resolve these conflicts. This specialized technique allows us to identify and eliminate, in a single step, all permutations of two currently assigned but incompatible paths. Each such permutation has exactly the same cost as a current path, and each one results in a new collision between the same two agents. We show that the addition of symmetry-breaking techniques can lead to an exponential reduction in the size of the search space of CBS, a popular framework for MAPF, and report significant improvements in both runtime and success rate versus CBSH and EPEA* â€“ two recent and state-of-the-art MAPF algorithms.
```
@inproceedings{LiAAAI19b,
 author = {Jiaoyang Li and Daniel Harabor and Peter J. Stuckey and Hang Ma and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {6087--6095},
 title = {Symmetry-Breaking Constraints for Grid-Based Multi-Agent Path Finding},
 year = {2019}
}
```
ACM SIGAI Activity Report.
Sven Koenig, Sanmay Das, Rosemary D. Paradis, John P. Dickerson, Yolanda Gil, Katherine Guo, Benjamin Kuipers, Iolanda Leite, Hang Ma, Nicholas Mattei, Amy McGovern, Larry Medsker, Todd W. Neller, Marion Neumann, Plamen Petrov, Michael Rovatsos and David G. Stork.
AI Matters 5(3), 2019.
[ pdf] [ abstract] [ bibtex]
We are happy to present the annual activity report of the ACM Special Interest Group on AI (ACM SIGAI), covering the period from July 2018 to June 2019.
```
@article{KoenigAIMATTERS19,
 author = {Sven Koenig and Sanmay Das and Rosemary D. Paradis and John P. Dickerson and Yolanda Gil and Katherine Guo and Benjamin Kuipers and Iolanda Leite and Hang Ma and Nicholas Mattei and Amy McGovern and Larry Medsker and Todd W. Neller and Marion Neumann and Plamen Petrov and Michael Rovatsos and David G. Stork},
 journal = {{AI} Matters},
 number = {3},
 pages = {6--11},
 title = {{ACM} {SIGAI} Activity Report},
 volume = {5},
 year = {2019}
}
```

2018

Multi-Agent Path Finding with Deadlines.
Hang Ma, Glenn Wagner, Ariel Felner, Jiaoyang Li, T. K. Satish Kumar and Sven Koenig.
International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, July 2018.
[ pdf] [ abstract] [ bibtex]
We formalize Multi-Agent Path Finding with Deadlines (MAPF-DL). The objective is to maximize the number of agents that can reach their given goal vertices from their given start vertices within the deadline, without colliding with each other. We first show that MAPF-DL is NP-hard to solve optimally. We then present two classes of optimal algorithms, one based on a reduction of MAPF-DL to a flow problem and a subsequent compact integer linear programming formulation of the resulting reduced abstracted multi-commodity flow network and the other one based on novel combinatorial search algorithms. Our empirical results demonstrate that these MAPF-DL solvers scale well and each one dominates the other ones in different scenarios.
```
@inproceedings{MaIJCAI18,
 author = {Hang Ma and Glenn Wagner and Ariel Felner and Jiaoyang Li and T. K. Satish Kumar and Sven Koenig},
 booktitle = {International Joint Conference on Artificial Intelligence},
 pages = {417--423},
 title = {Multi-Agent Path Finding with Deadlines},
 year = {2018}
}
```
Anytime Focal Search with Applications.
Liron Cohen, Matías Greco, Hang Ma, Carlos Hernandez, Ariel Felner, T. K. Satish Kumar and Sven Koenig.
International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, July 2018.
[ pdf] [ abstract] [ bibtex]
Focal search (FS) is a bounded-suboptimal search (BSS) variant of A*. Like A*, it uses an open list whose states are sorted in increasing order of their f-values. Unlike A*, it also uses a focal list containing all states from the open list whose f-values are no larger than a suboptimality factor times the smallest f-value in the open list. In this paper, we develop an anytime version of FS, called anytime FS (AFS), that is useful when deliberation time is limited. AFS finds a "good" solution quickly and refines it to better and better solutions if time allows. It does this refinement efficiently by reusing previous search efforts. On the theoretical side, we show that AFS is bounded suboptimal and that anytime potential search (ATPS/ANA*), a state-of-theart anytime bounded-cost search (BCS) variant of A*, is a special case of AFS. In doing so, we bridge the gap between anytime search algorithms based on BSS and BCS. We also identify different properties of priority functions, used to sort the focal list, that may allow for efficient reuse of previous search efforts. On the experimental side, we demonstrate the usefulness of AFS for solving hard combinatorial problems, such as the generalized covering traveling salesman problem and the multi-agent pathfinding problem.
```
@inproceedings{CohenIJCAI18,
 author = {Liron Cohen and Mat\'{i}as Greco and Hang Ma and Carlos Hernandez and Ariel Felner and T. K. Satish Kumar and Sven Koenig},
 booktitle = {International Joint Conference on Artificial Intelligence},
 pages = {1434--1441},
 title = {Anytime Focal Search with Applications},
 year = {2018}
}
```
Multi-Agent Path Finding with Deadlines: Preliminary Results.
Hang Ma, Glenn Wagner, Ariel Felner, Jiaoyang Li, T. K. Satish Kumar and Sven Koenig.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Stockholm, Sweden, July 2018.
[ pdf] [ abstract] [ bibtex] (Extended Abstract)
We formalize the problem of multi-agent path finding with deadlines (MAPF-DL). The objective is to maximize the number of agents that can reach their given goal vertices from their given start vertices within a given deadline, without colliding with each other. We first show that the MAPF-DL problem is NP-hard to solve optimally. We then present an optimal MAPF-DL algorithm based on a reduction of the MAPF-DL problem to a flow problem and a subsequent compact integer linear programming formulation of the resulting reduced abstracted multi-commodity flow network.
```
@inproceedings{MaAAMAS18,
 author = {Hang Ma and Glenn Wagner and Ariel Felner and Jiaoyang Li and T. K. Satish Kumar and Sven Koenig},
 booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
 pages = {2004--2006},
 title = {Multi-Agent Path Finding with Deadlines: Preliminary Results},
 year = {2018}
}
```
Adding Heuristics to Conflict-Based Search for Multi-Agent Pathfinding.
Ariel Felner, Jiaoyang Li, Eli Boyarski, Hang Ma, Liron Cohen, T. K. Satish Kumar and Sven Koenig.
International Conference on Automated Planning and Scheduling (ICAPS), Delft, Netherlands, June 2018.
[ pdf] [ abstract] [ bibtex] [ github]
Conflict-Based Search (CBS) and its enhancements are among the strongest algorithms for the multi-agent pathfinding problem. However, existing variants of CBS do not use any heuristics that estimate future work. In this paper, we introduce different admissible heuristics for CBS by aggregating cardinal conflicts among agents. In our experiments, CBS with these heuristics outperforms previous state-of-theart CBS variants by up to a factor of five.
```
@inproceedings{FelnerICAPS18,
 author = {Ariel Felner and Jiaoyang Li and Eli Boyarski and Hang Ma and Liron Cohen and T. K. Satish Kumar and Sven Koenig},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {83--87},
 title = {Adding Heuristics to Conflict-Based Search for Multi-Agent Pathfinding},
 year = {2018}
}
```

ACM SIGAI Activity Report.
Sven Koenig, Sanmay Das, Rosemary D. Paradis, John P. Dickerson, Yolanda Gil, Katherine Guo, Benjamin Kuipers, Hang Ma, Nicholas Mattei, Amy McGovern, Larry Medsker, Todd W. Neller, Plamen Petrov, Michael Rovatsos and David G. Stork.
AI Matters 4(3), 2018.
[ pdf] [ abstract] [ bibtex]

We are happy to present the annual activity report of ACM SIGAI, covering the period from July 2017 to June 2018.

@article{KoenigAIMATTERS18,
 author = {Sven Koenig and Sanmay Das and Rosemary D. Paradis and John P. Dickerson and Yolanda Gil and Katherine Guo and Benjamin Kuipers and Hang Ma and Nicholas Mattei and Amy McGovern and Larry Medsker and Todd W. Neller and Plamen Petrov and Michael Rovatsos and David G. Stork},
 journal = {{AI} Matters},
 number = {3},
 pages = {7--11},
 title = {{ACM} {SIGAI} Activity Report},
 volume = {4},
 year = {2018}
}

2017

Feasibility Study: Moving Non-Homogeneous Teams in Congested Video Game Environments.
Hang Ma, Jingxing Yang, Liron Cohen, T. K. Satish Kumar and Sven Koenig.
AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE), Snowbird, UT, October 2017.
[ pdf] [ abstract] [ bibtex] [ demo] (Demo Track)
Multi-agent path finding (MAPF) is a well-studied problem in artificial intelligence, where one needs to find collisionfree paths for agents with given start and goal locations. In video games, agents of different types often form teams. In this paper, we demonstrate the usefulness of MAPF algorithms from artificial intelligence for moving such non-homogeneous teams in congested video game environments.
```
@inproceedings{MaAIIDE17,
 author = {Hang Ma and Jingxing Yang and Liron Cohen and T. K. Satish Kumar and Sven Koenig},
 booktitle = {AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment},
 pages = {270--272},
 title = {Feasibility Study: Moving Non-Homogeneous Teams in Congested Video Game Environments},
 year = {2017}
}
```
Summary: Multi-Agent Path Finding with Kinematic Constraints.
Wolfgang Hönig, T. K. Satish Kumar, Liron Cohen, Hang Ma, Hong Xu, Nora Ayanian and Sven Koenig.
International Joint Conference on Artificial Intelligence (IJCAI), Melbourne, Australia, October 2017.
[ pdf] [ abstract] [ bibtex] (Sister Conference Best Paper Track)
Multi-Agent Path Finding (MAPF) is well studied in both AI and robotics. Given a discretized environment and agents with assigned start and goal locations, MAPF solvers from AI find collision-free paths for hundreds of agents with user-provided sub-optimality guarantees. However, they ignore that actual robots are subject to kinematic constraints (such as velocity limits) and suffer from imperfect plan-execution capabilities. We therefore introduce MAPF-POST to postprocess the output of a MAPF solver in polynomial time to create a plan-execution schedule that can be executed on robots. This schedule works on non-holonomic robots, considers kinematic constraints, provides a guaranteed safety distance between robots, and exploits slack to avoid time-intensive replanning in many cases. We evaluate MAPF-POST in simulation and on differential-drive robots, showcasing the practicality of our approach.
```
@inproceedings{HoenigIJCAI17,
 author = {Wolfgang H\"{o}nig and T. K. Satish Kumar and Liron Cohen and Hang Ma and Hong Xu and Nora Ayanian and Sven Koenig},
 booktitle = {International Joint Conference on Artificial Intelligence},
 pages = {4869--4873},
 title = {Summary: Multi-Agent Path Finding with Kinematic Constraints},
 year = {2017}
}
```
Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks.
Hang Ma, Jiaoyang Li, T. K. Satish Kumar and Sven Koenig.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), São Paulo, Brazil, May 2017.
[ pdf] [ abstract] [ bibtex] [ slides]
The multi-agent path-finding (MAPF) problem has recently received a lot of attention. However, it does not capture important characteristics of many real-world domains, such as automated warehouses, where agents are constantly engaged with new tasks. In this paper, we therefore study a lifelong version of the MAPF problem, called the multi-agent pickup and delivery (MAPD) problem. In the MAPD problem, agents have to attend to a stream of delivery tasks in an online setting. One agent has to be assigned to each delivery task. This agent has to first move to a given pickup location and then to a given delivery location while avoiding collisions with other agents. We present two decoupled MAPD algorithms, Token Passing (TP) and Token Passing with Task Swaps (TPTS). Theoretically, we show that they solve all well-formed MAPD instances, a realistic subclass of MAPD instances. Experimentally, we compare them against a centralized strawman MAPD algorithm without this guarantee in a simulated warehouse system. TP can easily be extended to a fully distributed MAPD algorithm and is the best choice when real-time computation is of primary concern since it remains efficient for MAPD instances with hundreds of agents and tasks. TPTS requires limited communication among agents and balances well between TP and the centralized MAPD algorithm.
```
@inproceedings{MaAAMAS17,
 author = {Hang Ma and Jiaoyang Li and T. K. Satish Kumar and Sven Koenig},
 booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
 pages = {837--845},
 title = {Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks},
 year = {2017}
}
```
Multi-Agent Path Finding with Delay Probabilities.
Hang Ma, T. K. Satish Kumar and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, February 2017.
[ pdf] [ abstract] [ bibtex]
Several recently developed Multi-Agent Path Finding (MAPF) solvers scale to large MAPF instances by searching for MAPF plans on 2 levels: The high-level search resolves collisions between agents, and the low-level search plans paths for single agents under the constraints imposed by the high-level search. We make the following contributions to solve the MAPF problem with imperfect plan execution with small average makespans: First, we formalize the MAPF Problem with Delay Probabilities (MAPF-DP), define valid MAPF-DP plans and propose the use of robust plan-execution policies for valid MAPF-DP plans to control how each agent proceeds along its path. Second, we discuss 2 classes of decentralized robust plan-execution policies (called Fully Synchronized Policies and Minimal Communication Policies) that prevent collisions during plan execution for valid MAPF-DP plans. Third, we present a 2-level MAPF-DP solver (called Approximate Minimization in Expectation) that generates valid MAPF-DP plans.
```
@inproceedings{MaAAAI17,
 author = {Hang Ma and T. K. Satish Kumar and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {3605--3612},
 title = {Multi-Agent Path Finding with Delay Probabilities},
 year = {2017}
}
```

Overview: A Hierarchical Framework for Plan Generation and Execution in Multi-Robot Systems.
Hang Ma, Wolfgang Hönig, Liron Cohen, Tansel Uras, Hong Xu, T. K. Satish Kumar, Nora Ayanian and Sven Koenig.
IEEE Intelligent Systems 32(6), 2017.
[ pdf] [ abstract] [ bibtex] [ video] [ publisher]

@article{MaIEEE17,
 author = {Hang Ma and Wolfgang H\"{o}nig and Liron Cohen and Tansel Uras and Hong Xu and T. K. Satish Kumar and Nora Ayanian and Sven Koenig},
 journal = {IEEE Intelligent Systems},
 number = {6},
 pages = {6-12},
 title = {Overview: A Hierarchical Framework for Plan Generation and Execution in Multi-Robot Systems},
 volume = {32},
 year = {2017}
}

AI Buzzwords Explained: Multi-Agent Path Finding (MAPF).
Hang Ma and Sven Koenig.
AI Matters 3(3), 2017.
[ pdf] [ abstract] [ bibtex]
A framework for coordinating taskand motion-level operations for many robots uses informed search to find causally feasible solutions and simple temporal networks to include kinematic constraints and react to dynamic changes.
```
@article{MaAIMATTERS17,
 author = {Hang Ma and Sven Koenig},
 journal = {AI Matters},
 number = {3},
 pages = {15--19},
 title = {{AI} Buzzwords Explained: Multi-Agent Path Finding ({MAPF})},
 volume = {3},
 year = {2017}
}
```

2016

Formation Change for Robot Groups in Occluded Environments.
Wolfgang Hönig, T. K. Satish Kumar, Hang Ma, Nora Ayanian and Sven Koenig.
IEEE/RSJ International Conference on Intelligent Robots and System (IROS), Daejeon, Korea, October 2016.
[ pdf] [ abstract] [ bibtex] [ video] [ publisher] [ project]
Path planning for multiple robots is well studied in the AI and robotics communities. For a given discretized environment, robots need to find collision-free paths to a set of specified goal locations. Robots can be fully anonymous, non-anonymous, or organized in groups. Although powerful solvers for this abstract problem exist, they make simplifying assumptions by ignoring kinematic constraints, making it difficult to use the resulting plans on actual robots. In this paper, we present a solution which takes kinematic constraints, such as maximum velocities, into account, while guaranteeing a user-specified minimum safety distance between robots. We demonstrate our approach in simulation and on real robots in 2D and 3D environments.
```
@inproceedings{HoenigIROS16,
 author = {Wolfgang H\"{o}nig and T. K. Satish Kumar and Hang Ma and Nora Ayanian and Sven Koenig},
 booktitle = {{IEEE/RSJ} International Conference on Intelligent Robots and System},
 pages = {4836--4842},
 title = {Formation Change for Robot Groups in Occluded Environments},
 year = {2016}
}
```
Overview: Generalizations of Multi-Agent Path Finding to Real-World Scenarios.
Hang Ma, Sven Koenig, Nora Ayanian, Liron Cohen, Wolfgang Hönig, T. K. Satish Kumar, Tansel Uras, Hong Xu, C. Tovey and G. Sharon.
IJCAI-16 Workshop on Multi-Agent Path Finding (WoMAPF), New York City, NY, July 2016.
[ pdf] [ abstract] [ bibtex] [ Synced (Chinese)]
Multi-agent path finding (MAPF) is well-studied in artificial intelligence, robotics, theoretical computer science and operations research. We discuss issues that arise when generalizing MAPF methods to real-world scenarios and four research directions that address them. We emphasize the importance of addressing these issues as opposed to developing faster methods for the standard formulation of the MAPF problem.
```
@inproceedings{MaWOMAPF16,
 author = {Hang Ma and Sven Koenig and Nora Ayanian and Liron Cohen and Wolfgang H\"{o}nig and T. K. Satish Kumar and Tansel Uras and Hong Xu and C. Tovey and G. Sharon},
 booktitle = {{IJCAI}-16 Workshop on Multi-Agent Path Finding},
 title = {Overview: Generalizations of Multi-Agent Path Finding to Real-World Scenarios},
 year = {2016}
}
```
Multi-Agent Path Finding with Kinematic Constraints.
Wolfgang Hönig, T. K. Satish Kumar, Liron Cohen, Hang Ma, Hong Xu, Nora Ayanian and Sven Koenig.
International Conference on Automated Planning and Scheduling (ICAPS), London, United Kingdom, June 2016.
[ pdf] [ abstract] [ bibtex] [ video] [ project]
Outstanding Paper Award in the Robotics Track of ICAPS 2016
Multi-Agent Path Finding (MAPF) is well studied in both AI and robotics. Given a discretized environment and agents with assigned start and goal locations, MAPF solvers from AI find collision-free paths for hundreds of agents with user-provided sub-optimality guarantees. However, they ignore that actual robots are subject to kinematic constraints (such as finite maximum velocity limits) and suffer from imperfect plan-execution capabilities. We therefore introduce MAPF-POST, a novel approach that makes use of a simple temporal network to postprocess the output of a MAPF solver in polynomial time to create a plan-execution schedule that can be executed on robots. This schedule works on non-holonomic robots, takes their maximum translational and rotational velocities into account, provides a guaranteed safety distance between them, and exploits slack to absorb imperfect plan executions and avoid time-intensive replanning in many cases. We evaluate MAPF-POST in simulation and on differential-drive robots, showcasing the practicality of our approach.
```
@inproceedings{HoenigICAPS16,
 author = {Wolfgang H\"{o}nig and T. K. Satish Kumar and Liron Cohen and Hang Ma and Hong Xu and Nora Ayanian and Sven Koenig},
 booktitle = {International Conference on Automated Planning and Scheduling},
 pages = {477--485},
 title = {Multi-Agent Path Finding with Kinematic Constraints},
 year = {2016}
}
```
Optimal Target Assignment and Path Finding for Teams of Agents.
Hang Ma and Sven Koenig.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Singapore, May 2016.
[ pdf] [ abstract] [ bibtex] [ slides]
We study the TAPF (combined target-assignment and pathfinding) problem for teams of agents in known terrain, which generalizes both the anonymous and non-anonymous multi-agent path-finding problems. Each of the teams is given the same number of targets as there are agents in the team. Each agent has to move to exactly one target given to its team such that all targets are visited. The TAPF problem is to first assign agents to targets and then plan collisionfree paths for the agents to their targets in a way such that the makespan is minimized. We present the CBM (Conflict-Based Min-Cost-Flow) algorithm, a hierarchical algorithm that solves TAPF instances optimally by combining ideas from anonymous and non-anonymous multi-agent pathfinding algorithms. On the low level, CBM uses a mincost max-flow algorithm on a time-expanded network to assign all agents in a single team to targets and plan their paths. On the high level, CBM uses conflict-based search to resolve collisions among agents in different teams. Theoretically, we prove that CBM is correct, complete and optimal. Experimentally, we show the scalability of CBM to TAPF instances with dozens of teams and hundreds of agents and adapt it to a simulated warehouse system.
```
@inproceedings{MaAAMAS16,
 author = {Hang Ma and Sven Koenig},
 booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
 pages = {1144--1152},
 title = {Optimal Target Assignment and Path Finding for Teams of Agents},
 year = {2016}
}
```
Path Planning with Kinematic Constraints for Robot Groups.
Wolfgang Hönig, T. K. Satish Kumar, Liron Cohen, Hang Ma, Sven Koenig and Nora Ayanian.
Southern California Robotics Symposium (SCR), San Diego, CA, April 2016.
[ pdf] [ abstract] [ bibtex] [ poster]
Path planning for multiple robots is well studied in the AI and robotics communities. For a given discretized environment, robots need to find collision-free paths to a set of specified goal locations. Robots can be fully anonymous, non-anonymous, or organized in groups. Although powerful solvers for this abstract problem exist, they make simplifying assumptions by ignoring kinematic constraints, making it difficult to use the resulting plans on actual robots. In this paper, we present a solution which takes kinematic constraints, such as maximum velocities, into account, while guaranteeing a user-specified minimum safety distance between robots. We demonstrate our approach in simulation and on real robots in 2D and 3D environments.
```
@inproceedings{HoenigSCR16,
 author = {Wolfgang H\"{o}nig and T. K. Satish Kumar and Liron Cohen and Hang Ma and Sven Koenig and Nora Ayanian},
 booktitle = {Southern California Robotics Symposium},
 title = {Path Planning with Kinematic Constraints for Robot Groups},
 year = {2016}
}
```
Multi-Agent Path Finding with Payload Transfers and the Package-Exchange Robot-Routing Problem.
Hang Ma, Craig Tovey, Guni Sharon, T. K. Satish Kumar and Sven Koenig.
AAAI Conference on Artificial Intelligence (AAAI), Phoenix, AZ, February 2016.
[ pdf] [ abstract] [ bibtex]
We study transportation problems where robots have to deliver packages and can transfer the packages among each other. Specifically, we study the package-exchange robot-routing problem (PERR), where each robot carries one package, any two robots in adjacent locations can exchange their packages, and each package needs to be delivered to a given destination. We prove that exchange operations make all PERR instances solvable. Yet, we also show that PERR is NP-hard to approximate within any factor less than 4/3 for makespan minimization and is NP-hard to solve for flowtime minimization, even when there are only two types of packages. Our proof techniques also generate new insights into other transportation problems, for example, into the hardness of approximating optimal solutions to the standard multi-agent path-finding problem (MAPF). Finally, we present optimal and suboptimal PERR solvers that are inspired by MAPF solvers, namely a flow-based ILP formulation and an adaptation of conflict-based search. Our empirical results demonstrate that these solvers scale well and that PERR instances often have smaller makespans and flowtimes than the corresponding MAPF instances.
```
@inproceedings{MaAAAI16,
 author = {Hang Ma and Craig Tovey and Guni Sharon and T. K. Satish Kumar and Sven Koenig},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {3166--3173},
 title = {Multi-Agent Path Finding with Payload Transfers and the Package-Exchange Robot-Routing Problem},
 year = {2016}
}
```
Planning, Scheduling and Monitoring for Airport Surface Operations.
Robert Morris, Corina Pasareanu, Kasper Luckow, Waqar Malik, Hang Ma, T. K. Satish Kumar and Sven Koenig.
AAAI-16 Workshop on Planning for Hybrid Systems (PlanHS), Phoenix, AZ, February 2016.
[ pdf] [ abstract] [ bibtex]
This paper explores the problem of managing movements of aircraft along the surface of busy airports. Airport surface management is a complex logistics problem involving the coordination of humans and machines. The work described here arose from the idea that autonomous towing vehicles for taxiing aircraft could offer a solution to the â€™capacity problemâ€™ for busy airports, the problem of getting more efficient use of existing surface area to meet increasing demand. Supporting autonomous surface operations requires continuous planning, scheduling and monitoring of operations, as well as systems for optimizing complex human-machine interaction. We identify a set of computational subproblems of the surface management problem that would benefit from recent advances in multi-agent planning and scheduling and probabilistic predictive modeling, and discuss preliminary work at integrating these components into a prototype of a surface management system.
```
@inproceedings{MorrisPLANHS16,
 author = {Robert Morris and Corina Pasareanu and Kasper Luckow and Waqar Malik and Hang Ma and T. K. Satish Kumar and Sven Koenig},
 booktitle = {{AAAI}-16 Workshop on Planning for Hybrid Systems},
 pages = {608--614},
 title = {Planning, Scheduling and Monitoring for Airport Surface Operations},
 year = {2016}
}
```

2015

Information Gathering and Reward Exploitation of Subgoals for POMDPs.
Hang Ma and Joelle Pineau.
AAAI Conference on Artificial Intelligence (AAAI), Austin, TX, January 2015.
[ pdf] [ abstract] [ bibtex] [ slides]
Planning in large partially observable Markov decision processes (POMDPs) is challenging especially when a long planning horizon is required. A few recent algorithms successfully tackle this case but at the expense of a weaker information-gathering capacity. In this paper, we propose Information Gathering and Reward Exploitation of Subgoals (IGRES), a randomized POMDP planning algorithm that leverages information in the state space to automatically generate "macro-actions" to tackle tasks with long planning horizons, while locally exploring the belief space to allow effective information gathering. Experimental results show that IGRES is an effective multi-purpose POMDP solver, providing state-of-the-art performance for both long horizon planning tasks and information-gathering tasks on benchmark domains. Additional experiments with an ecological adaptive management problem indicate that IGRES is a promising tool for POMDP planning in real-world settings.
```
@inproceedings{MaAAAI15,
 author = {Hang Ma and Joelle Pineau},
 booktitle = {{AAAI} Conference on Artificial Intelligence},
 pages = {3320--3326},
 title = {Information Gathering and Reward Exploitation of Subgoals for {POMDPs}},
 year = {2015}
}
```

Many publishers do not want authors to make their papers available electronically after the papers have been published. Please use the electronic versions provided here only if hardcopies are not yet available. If you have comments on any of these papers, please send me an email! Also, please send me your papers if we have common interests.