TY - GEN
T1 - Learning Robot Swarm Tactics over Complex Adversarial Environments
AU - Behjat, Amir
AU - Manjunatha, Hemanth
AU - Kumar, Prajit Krisshna
AU - Jani, Apurv
AU - Collins, Leighton
AU - Ghassemi, Payam
AU - Distefano, Joseph
AU - Doermann, David
AU - Dantu, Karthik
AU - Esfahani, Ehsan
AU - Chowdhury, Souma
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - To accomplish complex swarm robotic missions in the real world, one needs to plan and execute a combination of single robot behaviors, group primitives such as task allocation, path planning, and formation control, and mission-specific objectives such as target search and group coverage. Most such missions are designed manually by teams of robotics experts. Recent work in automated approaches to learning swarm behavior has been limited to individual primitives with sparse work on learning complete missions. This paper presents a systematic approach to learn tactical mission-specific policies that compose primitives in a swarm to accomplish the mission efficiently using neural networks with special input and output encoding. To learn swarm tactics in an adversarial environment, we employ a combination of 1) map-to-graph abstraction, 2) input/output encoding via Pareto filtering of points of interest and clustering of robots, and 3) learning via neuroevolution and policy gradient approaches. We illustrate this combination as critical to providing tractable learning, especially given the computational cost of simulating swarm missions of this scale and complexity. Successful mission completion outcomes are demonstrated with up to 60 robots. In addition, a close match in the performance statistics in training and testing scenarios shows the potential generalizability of the proposed framework.
AB - To accomplish complex swarm robotic missions in the real world, one needs to plan and execute a combination of single robot behaviors, group primitives such as task allocation, path planning, and formation control, and mission-specific objectives such as target search and group coverage. Most such missions are designed manually by teams of robotics experts. Recent work in automated approaches to learning swarm behavior has been limited to individual primitives with sparse work on learning complete missions. This paper presents a systematic approach to learn tactical mission-specific policies that compose primitives in a swarm to accomplish the mission efficiently using neural networks with special input and output encoding. To learn swarm tactics in an adversarial environment, we employ a combination of 1) map-to-graph abstraction, 2) input/output encoding via Pareto filtering of points of interest and clustering of robots, and 3) learning via neuroevolution and policy gradient approaches. We illustrate this combination as critical to providing tractable learning, especially given the computational cost of simulating swarm missions of this scale and complexity. Successful mission completion outcomes are demonstrated with up to 60 robots. In addition, a close match in the performance statistics in training and testing scenarios shows the potential generalizability of the proposed framework.
UR - https://www.scopus.com/pages/publications/85116619324
U2 - 10.1109/MRS50823.2021.9620707
DO - 10.1109/MRS50823.2021.9620707
M3 - Conference contribution
AN - SCOPUS:85116619324
T3 - 2021 International Symposium on Multi-Robot and Multi-Agent Systems, MRS 2021
SP - 83
EP - 91
BT - 2021 International Symposium on Multi-Robot and Multi-Agent Systems, MRS 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 International Symposium on Multi-Robot and Multi-Agent Systems, MRS 2021
Y2 - 4 November 2021 through 5 November 2021
ER -