TY - GEN
T1 - Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning
AU - Wang, Xueyuan
AU - Cenk Gursoy, M.
AU - Erpek, Tugba
AU - Sagduyu, Yalin E.
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/6
Y1 - 2021/6
N2 - Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks. In this paper, we aim to find collision-free paths for multiple cellular-connected UAVs, while satisfying requirements of connectivity with ground base stations (GBSs) in the presence of a dynamic jammer. We first formulate the problem as a sequential decision making problem in discrete domain, with connectivity, collision avoidance, and kinematic constraints. We, then, propose an offline temporal difference (TD) learning algorithm with online signal-to-interference-plus-noise ratio (SINR) mapping to solve the problem. More specifically, a value network is constructed and trained offline by TD method to encode the interactions among the UAVs and between the UAVs and the environment; and an online SINR mapping deep neural network (DNN) is designed and trained by supervised learning, to encode the influence and changes due to the jammer. Numerical results show that, without any information on the jammer, the proposed algorithm can achieve performance levels close to that of the ideal scenario with the perfect SINR-map. Real-time navigation for multi-UAVs can be efficiently performed with high success rates, and collisions are avoided.
AB - Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks. In this paper, we aim to find collision-free paths for multiple cellular-connected UAVs, while satisfying requirements of connectivity with ground base stations (GBSs) in the presence of a dynamic jammer. We first formulate the problem as a sequential decision making problem in discrete domain, with connectivity, collision avoidance, and kinematic constraints. We, then, propose an offline temporal difference (TD) learning algorithm with online signal-to-interference-plus-noise ratio (SINR) mapping to solve the problem. More specifically, a value network is constructed and trained offline by TD method to encode the interactions among the UAVs and between the UAVs and the environment; and an online SINR mapping deep neural network (DNN) is designed and trained by supervised learning, to encode the influence and changes due to the jammer. Numerical results show that, without any information on the jammer, the proposed algorithm can achieve performance levels close to that of the ideal scenario with the perfect SINR-map. Real-time navigation for multi-UAVs can be efficiently performed with high success rates, and collisions are avoided.
KW - Jamming resiliency
KW - collision avoidance
KW - decentralized deep reinforcement learning
KW - multi-UAV path planning
KW - wireless connectivity
UR - http://www.scopus.com/inward/record.url?scp=85112798941&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112798941&partnerID=8YFLogxK
U2 - 10.1109/ICCWorkshops50388.2021.9473587
DO - 10.1109/ICCWorkshops50388.2021.9473587
M3 - Conference contribution
AN - SCOPUS:85112798941
T3 - 2021 IEEE International Conference on Communications Workshops, ICC Workshops 2021 - Proceedings
BT - 2021 IEEE International Conference on Communications Workshops, ICC Workshops 2021 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Communications Workshops, ICC Workshops 2021
Y2 - 14 June 2021 through 23 June 2021
ER -