| Title |
A novel multi-scale feature fusion with adaptive scale-space pyramid network for aerial scene recognition using remote sensing images |
| Authors |
Abbas, Muhammad John ; Khan, Muhammad Attique ; Ahmed, Waqas ; Hamza, Ameer ; Hadj-Alouane, Nejib Ben ; Alsenan, Shrooq ; Alouane, M. Turki-Hadj ; Nam, Yunyoung |
| DOI |
10.1109/JSTARS.2026.3651659 |
| Full Text |
|
| Is Part of |
IEEE Journal of selected topics in applied Earth observations and remote sensing.. Piscataway, NJ : IEEE. 2026, vol. 19, p. 4337-4357.. ISSN 1939-1404. eISSN 2151-1535 |
| Keywords [eng] |
Remote sensing ; Image classification ; Adaptive Scale-Space Pyramid Network (ASSPN) ; Complexity-aware pooling ; Multi-scale feature fusion ; Explainability ; Gaussian pyramid ; Attention Mechanisms |
| Abstract [eng] |
Remote Sensing is an area anthropogenic study undertaken worldwide. It has succeeded significantly in important applications such as climate monitoring, disaster prediction and land use planning. However, due to the diversity of scales, intra-class similarities, and complex scenes, the accurate recognition process remains challenging. Transformers' global attention mechanism helps them to overcome the limitations of CNNs' local receptive fields; however, they have drawback of increased computing complexity. To overcome such challenges, this work proposes an Adaptive Scale-Space Pyramid Network (ASSPN) for improved remote sensing image classification. The ASSPN architecture contains a learnable Gaussian pyramid module for multi-scale feature representation, a scale selection attention mechanism for dynamically weighing feature relevance, a cross-feature propagation module for fusion guided by uncertainty, and a complexity-aware adaptive pooling module for preserving semantic discriminative features. Experiments are performed three benchmark datasets such as EuroSAT, NWPU-RESISC-45, and MLRSNet. On these datasets, the ASSPN achieves state-of-the-art results with accuracies of 96.14%, 94.73%, and 95.42%, respectively. The obtained accuracy is outperforming previous CNN and transformer-based systems with significant margins. Furthermore, ASSPN is noise perturbation-resistant and shows generalization capability across a wide range of land-cover categories. Ablation studies established the complementary benefits of the core modules, while LIME-based explainability analysis confirmed the predicative trustworthiness of the model. |
| Published |
Piscataway, NJ : IEEE |
| Type |
Journal article |
| Language |
English |
| Publication date |
2026 |
| CC license |
|