VLCA: vision-language aligning model with cross-modal attention for bilingual remote sensing image captioning
Tingting WEI, Weilin YUAN, Junren LUO, Wanpeng ZHANG, Lina LU
Journal of Systems Engineering and Electronics . 2023, (1): 9 -18 .  DOI: 10.23919/JSEE.2023.000035