Toward Conflict Resolution with Deep Multi-Agent Reinforcement Learning

Ralvi Isufaj, David Aranega Sebastia, Miquel Angel Piera

Producció científica: Contribució a revistaArticleRecercaAvaluat per experts

10 Cites (Scopus)

Resum

Safety in air traffic management at the tactical level is ensured by human controllers. Automatic detection and resolution tools are one way to assist controllers in their tasks. However, the majority of existing methods do not account for factors that can affect the quality and efficiency of resolutions. Furthermore, future challenges such as sustainability and the environmental impact of aviation must be tackled. In this work, we propose an innovative approach to pairwise conflict resolution, by modeling it as a multi-agent reinforcement learning to improve the quality of resolutions based on a combination of several factors. We use multi-agent deep deterministic policy gradient to generate resolution maneuvers. We propose a reward function that besides solving the conflicts attempts to optimize the resolutions in terms of time, fuel consumption, and airspace complexity. The models are evaluated on real traffic, with a data augmentation technique utilized to increase the variance of conflict geometries. We achieve promising results with a resolution rate of 93%, without the agents having any previous knowledge of the dynamics of the environment. Furthermore, the agents seem to be able to learn some desirable behaviors such as preferring small heading changes to solve conflicts in one time step. Nevertheless, the nonstationarity of the environment makes the learning procedure nontrivial. We argue ways that tangible qualities such as resolution rate and intangible qualities such as resolution acceptability and explainability can be improved.

Idioma originalAnglès
Pàgines (de-a)71-80
Nombre de pàgines10
RevistaJournal of Air Transportation
Volum30
Número3
DOIs
Estat de la publicacióPublicada - 2022

Fingerprint

Navegar pels temes de recerca de 'Toward Conflict Resolution with Deep Multi-Agent Reinforcement Learning'. Junts formen un fingerprint únic.

Com citar-ho