A Novel Transformer-Enhanced Framework for Multi-scale Feature Fusion in Remote Sensing Change Detection
DOI:
https://doi.org/10.62051/0f4peg35Keywords:
Remote sensing image change detection; Multi-scale feature fusion; Enhanced Transformer; Attention mechanism; Graph convolutional network.Abstract
With the rapid advancement of remote sensing technology, high-resolution satellite imagery has become increasingly accessible for monitoring urban expansion and surface changes. However, change detection in remote sensing images faces several significant challenges, including inconsistent image registration, inadequate handling of multi-scale features, class imbalance in training samples, and the occurrence of pseudo-changes that appear as modifications but represent no actual surface alterations. To address these limitations, this paper presents MA-CDNeXt, a novel deep learning model that enhances the Transformer architecture for improved change detection in high-resolution remote sensing imagery. The proposed model incorporates three key innovations: (1) an enhanced feature encoder that better captures spatial-temporal information, (2) integration of MAFGNet's multi-scale attention fusion mechanism within CDNeXt's spatial-temporal module to effectively process features at different scales, and (3) an optimized feature fusion decoder equipped with a dual-branch graph convolution module for more accurate change localization. To mitigate the class imbalance problem commonly encountered in change detection datasets, we employ a combined loss function incorporating cross-entropy loss, Dice loss, and focal loss. Experimental evaluation on the LEVIR-CD+ dataset demonstrates the effectiveness of our approach, achieving an overall accuracy of 84.09% and a recall rate of 78.39%. The results indicate that MA-CDNeXt successfully addresses the key challenges in remote sensing image change detection and provides a robust solution for monitoring surface changes in high-resolution imagery.
Downloads
References
[1] REN W, WANG Z, XIA M, et al. MFINet: Multi-scale feature interaction network for change detection of high-resolution remote sensing images[J]. Remote Sensing, 2024, 16(7): 1269.
[2] WANG H, YE Z, XU C, et al. TTMGNet: Tree topology mamba-guided network collaborative hierarchical incremental aggregation for change detection[J]. Remote Sensing, 2024, 16(21): 4068.
[3] SODNOM E, DAMDINSUREN A. Improvement of Mongolian Height System Using a Satellite Technology[J]. Open Journal of Applied Sciences, 2020, 10(4): 154-168.
[4] CAI L, YIN J, YAN X, et al. The environmental analysis and site selection of mussel and large yellow croaker aquaculture areas based on high resolution remote sensing[J]. Acta Oceanologica Sinica, 2024, 43(3): 66-86.
[5] LIU J, GU H, LIU F, et al. CE-CDNet: A Transformer-Based Channel Optimization Approach for Change Detection in Remote Sensing[J]. Computers, Materials & Continua, 2025, 83(4): 803-822.
[6] YAN W, CAO L, YAN P, et al. Remote sensing image change detection based on swin transformer and cross-attention mechanism[J]. Earth Science Informatics, 2025, 18(1): 106.
[7] RAO M S, RAO G R, KRISHNA G M, et al. Revolutionizing Groundwater Suitability with AI-Driven Spatial Decision Support—A Remote Sensing and GIS Approach for Visakhapatnam District, Andhra Pradesh, India[J]. Journal of Geographic Information System, 2025, 17(1): 23-44.
[8] SARR A. Multi-Scale Characteristics of Precipitation and Temperature over West Africa Using SMHI-RCA Driven by GCMs under RCP8.5[J]. American Journal of Climate Change, 2017, 6(3): 455-486.
[9] LI Y M, ZHAN R F, DING Y H. Multi-Scale Influencing Factors and Prediction of Interannual Variability in Rapid Intensification Magnitude of Northwest Pacific Tropical Cyclones[J]. Journal of Tropical Meteorology, 2025, 31(1): 75-86.
[10] LI Z, CAO S, DENG J, et al. STADE-CDNet: Spatial-temporal attention with difference enhancement-based network for remote sensing image change detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62(1): 1-15.
[11] WANG Z, GU G, XIA M, et al. Bitemporal attention sharing network for remote sensing image change detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62(1): 1-15.
[12] ACHOUR S, ELMEZOUARI M C, TALEB N, et al. A PCA-PD fusion method for change detection in remote sensing multi temporal images[J]. International Journal of Remote Sensing, 2019, 40(15): 2-15.
[13] DALMIYA C P, SANTHI N, SATHYABAMA B. A novel feature descriptor for automatic change detection in remote sensing images[J]. Egyptian Journal of Remote Sensing and Space Science, 2018, 21(1): 1-10.
[14] FAN C L, CHUNG Y J. Integrating Image Processing Technology and Deep Learning to Identify Crops in UAV Orthoimages[J]. Computers, Materials & Continua, 2025, 82(2): 1925-1945.
[15] SAIDI S, IDBRAIM S, KARMOUDE Y, et al. Deep-learning for change detection using multi-modal fusion of remote sensing images: A review[J]. Remote Sensing, 2022, 14(10): 2343.
[16] NOMAN M, FIAZ M, CHOLAKKAL H, et al. ELGC-Net: Efficient local-global context aggregation for remote sensing change detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62(1): 1-10.
[17] ZOU C, LIANG W, LIU L, et al. Hyperspectral image change detection based on an improved multi-scale and spectral-wise transformer[J]. International Journal of Remote Sensing, 2024, 45(6): 1903-1924.
[18] XU L, LI F, CHANG S. Unifying Convolution and Transformer Decoder for Textile Fiber Identification[J]. Journal of Donghua University (English Edition), 2023, 40(4): 357-363.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Transactions on Environment, Energy and Earth Sciences

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.









