¸£Àû¼§

Hierarchical Mission Planning for Autonomous Agents in Uncertain Environments

En man på toppen av ett militärplan. Photographer: Linus Haegermark @Saab, Copyright Saab AB

In collaboration with Saab Aeronautics, we develop methods for dynamic and intelligent decision-making for autonomous agents by combining optimisation methods with learning-based approaches.

"Plans are useless, but planning is indispensable." In highly uncertain environments, the ability to reiterate the initial plan is crucial since unexpected events are certain to occur.

Autonomous agents such as Unmanned (Combat) Aerial Vehicles, or U(C)AVs for short, are aircraft capable of performing missions without direct human control. These versatile platforms are invaluable in both military and civilian applications, particularly in adverse environments. However, the key challenge lies in enabling these agents to make quick, intelligent decisions during missions.

Uniting learning-based methods and mathematical optimisation

Artificial intelligence, particularly reinforcement learning (RL), offers significant potential for adaptability, optimisation, and autonomous operation. RL methods learn from environmental interactions, but they face challenges in asymmetric cases involving heterogeneous agents. The diversity of agents increases complexity, making it difficult to learn effective strategies and leading to scalability issues. Additionally, RL's performance is heavily dependent on training data, which can be difficult, expensive, and dangerous to collect in real-world scenarios. Simulations are typically used for training, but differences between simulated and real environments can cause performance drops after deployment. Therefore, agents need to adapt efficiently to new settings, which can be challenging in complex and unpredictable situations.

Operations research methods, such as mathematical optimisation and heuristics, do not require agent training and need less data. These models can be constructed to be robust to uncertainties and variations in the environment, and they are easily designed to handle a varying number of heterogeneous agents. However, optimisation-based methods are typically not suited for highly dynamic and uncertain environments. While optimisation provides interpretability, data-driven black-box methods are easier to implement as they do not require understanding internal mechanics.

Combining RL and optimisation methods can leverage RL's adaptability in dynamic environments with the robustness and interpretability of optimisation methods, creating more effective and scalable solutions for complex missions.

A hierarchical mission framework

Flygplan. Photographer: Saab
By combining learning-based methods with optimisation approaches to create anytime hybrid methods, the resulting algorithms can enable quicker decision-making and execution, which is crucial for safety in time-sensitive operations.

In uncertain environments, dynamic real-time replanning is crucial. For multi-agent missions, this task easily becomes complex. A hierarchical model approach can be used, which divides the planning into two categories: high-level and low-level. The high-level method plans for a longer time horizon by performing resource allocation and scheduling and adapting the initial plan as necessary. The low-level method decides how tasks should be executed, with a shorter time horizon than the high level. RL-based methods are effective for low-level planning, while optimisation-based methods provide a better long-term perspective for high-level planning, especially when training data is limited.

Contact

Project publications

Search results can unfortunately not be shown now, please return later

Organisation

More about research at LiU