TransfusionNet: Framework for cervical cancer detection using deep learning with multi-level fusion

Md. Najib Hasan, Sourav Basak Shuvo, Md Mahadi Hasan Ankon, S.M. Taslim Uddin Raju, Nazmul Siddique

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a hybrid deep learning model, TransfusionNet, comprising VGG19, ResNet50, and ViT with fusion for early detection of cervical cancer. An improved medical image preprocessing technique combining contrast-limited adaptive histogram equalization (CLAHE), Laplacian sharpening, bilateral filtering, and unsharp masking has been applied to enhance image quality. An early-layer feature fusion strategy was used to enhance feature diversity and improve decision-making, unlike conventional methods that fuse only at final layers. The fusion strategy combines features from VGG19, ResNet50, and ViT at multiple stages using concatenation and element-wise addition, allowing integration of detailed and high-level information across different levels. Two separate transition blocks were used to align VGG19 and ResNet50 outputs at different layers. TransfusionNet is trained and tested on the publicly available SIPaKMeD dataset, containing 4,049 images of five classes. TransfusionNet is also verified on Herlev and LCPSI datasets. The five-fold cross-validation was employed to mitigate overfitting and enhance model's robustness and generalizability. TransfusionNet demonstrates promising results for early cervical cancer detection, achieving state-of-the-art performance on the SIPaKMeD dataset. For batch size 128, TransfusionNet achieved an average accuracy of 99.40%, an F1-score of 99.36%, sensitivity of 99.25%, and Cohen's Kappa of 99.67% across the five folds. On the SIPaKMeD dataset, the proposed model outperformed the best-performing model by 3.77% among the cited models. On the Herlev and LCPSI datasets, TransfusionNet surpassed the top-performing methods by 6.50% and 1.22% in accuracy, respectively. Future work will focus on validating the model on larger and more diverse datasets. The source code is available at https://github.com/souravbasakshuvo/TransFusionNet.
Original languageEnglish
Article number107174
Pages (from-to)1-21
Number of pages21
JournalResults in Engineering
Volume28
Early online date19 Sept 2025
DOIs
Publication statusPublished (in print/issue) - 1 Dec 2025

Bibliographical note

Publisher Copyright:
© 2025

Data Access Statement

Data will be made available on request.

Keywords

  • Aggregated fusion
  • Bi-fusion
  • Cervical cancer
  • Classification
  • Early detection
  • Ensemble learning

Fingerprint

Dive into the research topics of 'TransfusionNet: Framework for cervical cancer detection using deep learning with multi-level fusion'. Together they form a unique fingerprint.

Cite this