Journal of Systems Engineering and Electronics ›› 2026, Vol. 37 ›› Issue (2): 357-366.doi: 10.23919/JSEE.2026.000061

• ELECTRONICS TECHNOLOGY • Previous Articles    

MMF-SLR: a sign language recognition method based on multi-modal feature using millimeter-wave radar

Chang CUI1,2,3(), Guiyan WEI2(), Xichao DONG1,2,3,*(), Cheng HU1,2,3(), Jianping WANG1,2,3()   

  1. 1Radar Technology Research Institute, Beijing Institute of Technology, Beijing 100081, China
    2Chongqing Innovation Center, Beijing Institute of Technology, Chongqing 401120, China
    3Chongqing Key Laboratory of Novel Civilian Radar, Chongqing 401120, China
  • Received:2024-07-16 Accepted:2026-03-26 Online:2026-04-18 Published:2026-04-30
  • Contact: Xichao DONG E-mail:cuichang0329@qq.com;weigy9677@163.com;xcdong@bit.edu.cn;cchchb@163.com;jianpingwang@bit.edu.cn
  • About author:
    CUI Chang was born in 1994. She received her B.S. and Ph.D. degrees from the School of Information and Electronics, Beijing Institute of Technology, in 2016 and 2022, respectively. She is currently a Post-Doctoral Researcher with the Beijing Institute of Technology Chongqing Innovation Center. Her research interests include geosynchronous synthetic aperture radar (SAR) signal processing and moving target indication. E-mail: cuichang0329@qq.com

    WEI Guiyan was born in 1996. She is pursuing her master degree in Chongqing Three Gorges University. Her research interest is cross-modal sign language recognition technology based on millimeter wave radar. E-mail: weigy9677@163.com

    DONG Xichao was born in 1986. He received his B.S. and Ph.D. degrees from the School of Information and Electronics, Beijing Institute of Technology (BIT) in 2008 and 2014, respectively. He is currently an associate professor at BIT. His research interests include radar signal processing and weather radar. E-mail: xcdong@bit.edu.cn

    HU Cheng was born in 1981. He received his B.S. and Ph.D. degrees from National University of Defense Technology and Beijing Institute of Technology respectively. Since 2009, he has been with Beijing Institute of Technology (BIT)’s School of Information and Electronics, and is now a full professor, doctoral supervisor, and Vice President of the School of Information and Electronics in BIT. His research interests include synthetic aperture radar (SAR) and entomological radar signal processing. E-mail: cchchb@163.com

    WANG Jianping was born in 1987. He received his Ph.D. degree in electrical engineering from Delft University of Technology in 2018. He is currently a full professor at the School of Information and Electronics, Beijing Institute of Technology, focusing on microwave imaging, signal processing, and antenna array design. E-mail: jianpingwang@bit.edu.cn
  • Supported by:
    This work was supported by the Postdoctoral Science Foundation of Chongqing in China (CSTB2022NSCQ-BHX0699), and the Chongqing Special Program for Technological Innovation and Application Development (CSTB2023TIAD-KPX0065).

Abstract:

Sign language recognition (SLR) can be improved by using millimeter-wave radar, which safeguards the privacy of those who are hearing-impaired. However, existing SLR methods do not fully utilize the unique features of radar echoes, resulting in limited accuracy. Sign language is composed of an individual’s poses and hand movements. To obtain these recognition features, this paper presents a multi-modal-feature-based SLR (MMF-SLR) network framework. This method first constructs a transformer-pose network to extract the human skeleton information, which represents the poses in sign language, from the radar images. Additionally, hand movement information can be represented by the range-Doppler sequence and micro-Doppler signatures. The human skeleton and hand movement information are input into a multimodal fusion network to achieve high-accuracy SLR. The experimental results demonstrate that the proposed method can enhance the recognition accuracy of the sign language with similar poses or movements compared to the traditional SLR methods.

Key words: sign language recognition, millimeter-wave radar, multimodal, pose estimation, micro-Doppler, range-Doppler