A Lightweight Hybrid CNN–CBAM Model for Multistage Acute Lymphoblastic Leukemia Classification from Peripheral Blood Smear Images
Accurate and efficient classification of hematological malignancies from peripheral blood smear (PBS) images remains challenging due to the scarcity of annotated datasets, staining variability, and subtle morphological differences among blood cancer sub-types. To address these limitations, this study proposes an Advanced Lightweight Deep Learning (ALDL) framework for the multi-class classification of Acute Lymphoblastic Leukemia (ALL) across four clinically significant stages: Benign, Pro-B, Pre-B, and Early Pre-B. The framework integrates EfficientNetV2-S with Convolutional Block Attention Modules (CBAM) to enhance spatial and channel-wise feature refinement. At the same time, Focal Loss is employed to mitigate class imbalance by prioritizing hard-to-classify samples. A robust preprocessing pipeline, including CLAHE contrast enhancement, Reinhard stain normalization, and data augmentation, improves feature visibility and dataset generalization. Lesion segmentation is performed using RGB-based threshold-ing and watershed overlay, followed by lesion-level cropping to ensure consistency across inputs. Experimental evaluations on the ALL-DB dataset demonstrate the supe-rior performance of the proposed method, achieving average accuracy of 96.11%, F1-score of 95.99%, and AUC of 0.9875. Comparative analyses against MobileNetV3, ResNet50, DenseNet121, VGG16, and InceptionV3 confirm that the hybrid approach consistently outperforms conventional CNN architectures across both 70:30 and 60:40 train–test splits. Furthermore, a detailed investigation of color spaces (RGB, HSV, LAB, HED) indicates that RGB yields the most reliable segmentation and classification re-sults. At the same time, HED enhances lesion visualization at the expense of higher computational cost. The proposed ALDL framework demonstrates strong potential for real-world application as a computer-aided diagnostic (CAD) system for early leukemia detection, offering improved diagnostic reliability, reduced error rates, and practical scalability for clinical environments.