Title A hybrid deep learning approach integrating CNN and transformer for lung cancer classification using CT scans
Authors Yousafzai, Samia Nawaz ; Nasir, Inzamam Mashood ; Mansour, Sahar ; Negm, Noha ; Alhashmi, Asma A ; Alharbi, Mohannad A ; Kim, Eunchan
DOI 10.1038/s41598-026-41161-7
Full Text Download
Is Part of Scientific reports.. Berlin : Springer Nature. 2026, Early access. ISSN 2045-2322
Keywords [eng] CT Imaging ; Convolutional Neural Network ; Deep Learning ; Hybrid Shifted Window ; Improved Swin Transformer ; Lung Cancer
Abstract [eng] Lung cancer is an extremely fatal kind of cancer, resulting in the deaths of almost 7.6 million individuals annually around the globe. Nevertheless, a timely diagnosis is a crucial necessity for enhancing the likelihood of human survival. Regarding tumor identification, CT scans are normally used to identify affected areas. Nevertheless, CT imaging face significant problems such as poor visibility of tumor locations and high false negative rates. The small dataset size of medical imaging makes it challenging to capture local lesion features by iterative training, considering all input features equally. This work integrates Convolutional Neural Network (CNN) and Improved Swin Transformer (C-Swin), a deep learning model that extracts and integrates fine-grained local and global features. C-Swin has Transformer encoder and a CNN module. The CNN module extracts local features, whereas the Transformer module captures global features. The Transformer encoder uses a hybrid shifted window attention method to focus on a spatial region of the CT image, reducing background semantic information and improving local feature capture accuracy. The proposed method is validated using the publicly accessible Kaggle dataset namely IQ-OTH/NCCD with three classes. the proposed C-Swin model achieved average accuracy of 96.26%, precision of 97.48%, recall of 96.39% and F1-score of 97.42%. The numerical findings unequivocally demonstrate that our proposed method surpasses various existing methods with an increase in accuracy ranging from 2.31% to 6.81%. The C-Swin model is capable of extracting detailed local lesion features, resulting in improved classification performance.
Published Berlin : Springer Nature
Type Journal article
Language English
Publication date 2026
CC license CC license description