Journal of Systems Engineering and Electronics ›› 2026, Vol. 37 ›› Issue (2): 567-578.doi: 10.23919/JSEE.2024.000006

• SYSTEMS ENGINEERING • Previous Articles    

Equilibrium learning for multi-stage cyber-physical multi-domain security game in island air defense

Weilin YUAN1(), Shaofei CHEN2,*(), Lina LU2(), Zhenzhen HU2(), Yu XIE2(), Jing CHEN2()   

  1. 1College of Information and Communication, National University of Defense Technology Wuhan 430014, China
    2College of Intelligence Science and Technology, National University of Defense Technology Changsha 410073, China
  • Received:2023-07-06 Online:2026-04-18 Published:2026-04-30
  • Contact: Shaofei CHEN E-mail:yuanweilin12@nudt.edu.cn;chensf005@163.com;lulina16@nudt.edu.cn;hzzmail@163.com;xieyu_nudt@139.com;Chenjing001@vip.sina.com
  • About author:
    YUAN Weilin was born in 1994. He received his B.S., M.S., and Ph.D. degrees in control science and engineering from National University of Defense Technology, Changsha, China, in 2012, 2016 and 2023 respectively. He is a lecturer in National University of Defense Technology. His research interests include cognitive decision-making, intelligent gaming, opponent modeling, reinforcement learning, and multi-agent system. E-mail: yuanweilin12@nudt.edu.cn

    CHEN Shaofei was born in 1987. He received his B.S. degree from Harbin Institute of Technology, Harbin, China, in 2009, M.S. and Ph.D. degrees in control science and engineering from National University of Defense Technology, Changsha, in 2011 and 2016 respectively. Since 2019, he has been an associate professor with the College of Intelligence Science and Technology, National University of Defense Technology. His research interests include artificial intelligence, multiagent system, and reinforcement learning. E-mail: chensf005@163.com

    LU Lina was born in 1984. She received her Ph.D. degree in control science and engineering from National University of Defense Technology, Changsha, China, in 2020. Since 2020, she has been a lecturer with the College of Intelligence Science and Technology, National University of Defense Technology. Her current research interests include reinforcement learning, opponent modeling, and complex network. E-mail: lulina16@nudt.edu.cn

    HU Zhenzhen was born in 1984. He received his B.S. degree in thermal energy and power engineering from Shanghai Jiaotong University in 2006 and M.S. degree in fluid mechanics at China Aerodynamics Research and Development Center in 2009. He is currently pursuing his Ph.D. degree in control science and engineering at National University of Defense Technology, Changsha, China. His current research interests include artificial intelligence, opponent modeling, and game theory. E-mail: hzzmail@163.com

    XIE Yu was born in 1982. He received his M.S. and Ph.D. degrees in aeronautical and astronautical science and technology from National University of Defense Technology, Changsha, in 2007 and 2012. Since 2015, he has been an associate professor with the College of Intelligence Science and Technology, National University of Defense Technology. His research interests include intelligent decision-making and planning. E-mail: xieyu_nudt@139.com

    CHEN Jing was born in 1972. He received his M.S. and Ph.D. degrees in control science and engineering from National University of Defense Technology, Changsha, China, in 1993 and 1999, respectively. He is a full professor with the Department of Intelligence Science and Technology, College of Intelligence Science and Technology, National University of Defense Technology. His current research interests include artificial intelligence, intelligence control, and unmanned vehicle mission planning. E-mail: Chenjing001@vip.sina.com
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (92271108; 61702528; 61806212; 62173336).

Abstract:

Multi-domain competition is developing for disintegrating the component of the opponent’s operational system and winning advantage in decision space. Island air defense is a typical multi-domain security problem, which dramatically increases the complexity of decision-making by considering different factors such as multi-stages decisions, multi-domain settings, imperfection information, and uncertain events. However, current research on island air defense security problems is sparse and lacks consideration of key factors. To provide support for assisting human commanders to take wise decisions in a complex environment, we build a multi-domain multi-state island air defense model and propose responding solving algorithms. We study the whole progress of island air defense and propose a multi-domain, multi-stage imperfection information security game that formulates critical characters in the adversarial scenario of island air defense. In addition, considering a bounded rational opponent’s possible strategies, we propose an opponent-aware Monte Carlo counterfactual regret minimization algorithm for learning a robust defensive strategy in the security game. We evaluate our methods in various adversarial scenarios. The results show that our equilibrium learning method can effectively play against an opponent with bounded rationality and significantly outperform some advanced algorithms.

Key words: island air defense, counterfactual regret minimization, Nash equilibrium, security game, cyber-physical system