AI Assisted Diagnosis of Neonatal Ear Deformities Using ResNet50 CBAM and CNN Transformer Models

Research output: Contribution to journalArticlepeer-review

Abstract

Early detection of congenital auricular deformity in infants is essential for early medical intervention, as late diagnosis would have long term psychosocial impacts and facial asymmetry. This study introduces two deep learning based architectures for the automated classification of auricular deformities: a hybrid Convolutional Neural Network Transformer (CNNT) and a new ResNet50 with a Convolutional Block Attention Module (ResNet50 CBAM). The novel ResNet50 CBAM architecture efficiently facilitates spatial and channel wise feature learning using attention mechanisms, allowing the model to focus on deformity specific regions while eliminating irrelevant background noise. Comprehensive experiments were conducted on BabyEar4k, an expert annotated, publicly released dataset containing 3,852 high resolution ear images across five clinically relevant classes of deformities. To substantiate the effectiveness of the proposed approach, a comparison was drawn with the current state-of-the-art scheme proposed by Liu Jie Ren et al., showing significant strides in classification performance. The ResNet50 CBAM model achieved a classification accuracy of 96.25%, an F1 score of 0.9576, and an AUC-ROC of 0.984, outperforming the CNNT model, which reached an accuracy of 93.97% and an F1 score of 0.9395, as well as the benchmark baseline F1 score of 0.832. These results highlight the robustness of the model and its generalization capability across classes of deformity. The proposed framework addresses some crucial shortcomings of previous works, including reliance on binary classification, insufficient subtype granularity, and inadequate preprocessing. Employing BM3D-based denoising and SMOTE-based class balancing enhances the quality and learning efficiency. In all, the methodology presents a straightforward, accurate, and scalable approach to AI-based neonatal ear deformity screening, offering substantial potential for wide integration into clinical decision-support systems to facilitate early intervention, decrease professional workload, and improve overall pediatric outcomes.
Original languageEnglish
Pages (from-to)1-15
Number of pages15
JournalIEEE Access
DOIs
Publication statusPublished (in print/issue) - 18 Feb 2026

Bibliographical note

Publisher Copyright:
© 2013 IEEE.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 15 - Life on Land
    SDG 15 Life on Land

Keywords

  • Ear
  • , Residual neural networks
  • Medical diagnostic imaging
  • Convolutional neural networks
  • Healthcare artificial intelligent (HAI)
  • Block Matching 3D Filtering (BM3D)
  • ResNet50 model Convolutional Block Attention Module (CBAM)
  • Convolutional Neural Network–Transformer hybrid (CNNT)
  • Synthetic Minority Over sampling Technique (SMOTE)

Fingerprint

Dive into the research topics of 'AI Assisted Diagnosis of Neonatal Ear Deformities Using ResNet50 CBAM and CNN Transformer Models'. Together they form a unique fingerprint.

Cite this