Journal of Systems Engineering and Electronics ›› 2023, Vol. 34 ›› Issue (3): 682-695.doi: 10.23919/JSEE.2023.000028

• ELECTRONICS TECHNOLOGY • Previous Articles    

Dual-stream coupling network with wavelet transform for cross-resolution person re-identification

Rui SUN1,2(), Zi YANG1,2,*(), Zhenghui ZHAO1,2(), Xudong ZHANG1,2()   

  1. 1 Key Laboratory of Knowledge Engineering with Big Data (Ministry of Education), Hefei University of Technology, Hefei 230601, China
    2 School of Computer and Information, Hefei University of Technology, Hefei 230601, China
  • Received:2021-07-20 Online:2023-06-15 Published:2023-06-30
  • Contact: Zi YANG E-mail:sunrui@hfut.edu.cn;yangzi@mail.hfut.edu.cn;904150289@qq.com;xudong@hfut.edu.cn
  • About author:
    SUN Rui was born in 1976. He received his B.S. degree from the Central South University of China, in 1998, M.S. degree from Harbin Engineering University of China, in 2000, and Ph.D. degree from Huazhong University of Science and Technology of China, in 2003. He worked as a senior software engineer with TCL Mobile Communication Company, China, from 2003 to 2005. He was a visiting scholar with the Computer Science Department, University of Missouri, Columbia, MO, USA, from 2010 to 2011. He was a postdoctoral researcher with Chery automobile Company, China, from 2012 to 2014. He is currently a professor with Hefei University of Technology, China. His research interests include object recognition and tracking, computer vision, and machine learning. E-mail: sunrui@hfut.edu.cn

    YANG Zi was born in 1997. She received her B.S. degree from West Anhui University, in 2015. She is currently pursuing her M.S. degree with Hefei University of Technology. Her research interests include object recognition and tracking, computer vision, and image processing. E-mail: yangzi@mail.hfut.edu.cn

    ZHAO Zhenghui was born in 1994. She received her B.S. degree from Hefei University of Technology, in 2015. She is currently pursuing her M.S. degree with Hefei University of Technology. Her research interests include object recognition and tracking, computer vision, and visible-infrared image processing. E-mail: 904150289@qq.com

    ZHANG Xudong was born in 1966. He received his B.S. degree from Hefei University of Technology in 1989, M.S. degree from Hefei University of Technology in 1992, and Ph.D. degree from University of Science and Technology of China in 2005. He spent three months in collaborative research at Heilbronn University in Germany in 2006. Currently, he is a professor at Hefei University of Technology, China. His main research interests include image processing, pattern recognition, and intelligent information processing. E-mail: xudong@hfut.edu.cn
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61471154;61876057), and the Key Research and Development Program of Anhui Province-Special Project of Strengthening Science and Technology Police (202004D07020012).

Abstract:

Person re-identification is a prevalent technology deployed on intelligent surveillance. There have been remarkable achievements in person re-identification methods based on the assumption that all person images have a sufficiently high resolution, yet such models are not applicable to the open world. In real world, the changing distance between pedestrians and the camera renders the resolution of pedestrians captured by the camera inconsistent. When low-resolution (LR) images in the query set are matched with high-resolution (HR) images in the gallery set, it degrades the performance of the pedestrian matching task due to the absent pedestrian critical information in LR images. To address the above issues, we present a dual-stream coupling network with wavelet transform (DSCWT) for the cross-resolution person re-identification task. Firstly, we use the multi-resolution analysis principle of wavelet transform to separately process the low-frequency and high-frequency regions of LR images, which is applied to restore the lost detail information of LR images. Then, we devise a residual knowledge constrained loss function that transfers knowledge between the two streams of LR images and HR images for accessing pedestrian invariant features at various resolutions. Extensive qualitative and quantitative experiments across four benchmark datasets verify the superiority of the proposed approach.

Key words: cross-resolution, feature invariant learning, person re-identification, residual knowledge transfer, wavelet transform