Journal of Systems Engineering and Electronics

   

Equilibrium learning for multi-stage cyber-physical multi-domain security game in island air defense

Weilin YUAN1, Shaofei CHEN2, Lina LU2, Zhenzhen HU2, Yu XIE2, Jing CHEN2   

  1. 1. College of Information and Communication, National University of Defense Technology Wuhan 430014, China;
    2. College of Intelligence Science and Technology, National University of Defense Technology Changsha 410073, China
  • Received:2023-07-06
  • Contact: CHEN Shaofei E-mail:yuanweilin12@nudt.edu.cn;chensf005@163.com;lulina16@nudt.edu.cn;hzzmail@163.com;xieyu_nudt@139.com;Chenjing001@vip.sina.com
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (92271108; 61702528; 61806212; 62173336).

Abstract: Multi-domain competition is developing for disintegrating the component of the opponent’s operational system and winning advantage in decision space. Island air defense is a typical multi-domain security problem, which dramatically increases the complexity of decision-making by considering different factors such as multi-stages decisions, multi-domain settings, imperfection information, and uncertain events. However, current research on island air defense security problems is sparse and lacks consideration of key factors. To provide support for assisting human commanders to take wise decisions in a complex environment, we build a multi-domain multi-state island air defense model and propose responding solving algorithms. We study the whole progress of island air defense and propose a multi-domain, multi-stage imperfection information security game that formulates critical characters in the adversarial scenario of island air defense. In addition, considering a bounded rational opponent’s possible strategies, we propose an opponent-aware Monte Carlo counterfactual regret minimization algorithm for learning a robust defensive strategy in the security game. We evaluate our methods in various adversarial scenarios. The results show that our equilibrium learning method can effectively play against an opponent with bounded rationality and significantly outperform some advanced algorithms.

Key words: island air defense, counterfactual regret minimization, Nash equilibrium, security game, cyber-physical system