Abstract
Facial emotion recognition remains a challenging task due to the complexity of human expression and class imbalance in real-world datasets. This paper proposes a Bagging Ensemble framework for emotion classification using the manually annotated subset of the AffectNet dataset. The ensemble integrates three diverse Convolutional Neural Network (CNN) architectures, ResNet50, EfficientNet-B0, and MobileNetV2, each trained on a unique bootstrapped sample of the AffectNet training set. By leveraging both data resampling and architectural diversity, the ensemble aims to reduce variance and improve generalisation. Experiments demonstrate that the proposed ensemble achieves superior classification performance compared to individual base models, particularly in terms of robustness and Top-3 accuracy, demonstrating the effectiveness of Bagging for large-scale, imbalanced FER in unconstrained settings. This work also provides high performance benchmark using only facial cues, without augmentation or handcrafted features
| Original language | English |
|---|---|
| Title of host publication | Irish Machine Vision and Image Processing Conference 2025 |
| Pages | 112 - 119 |
| Number of pages | 8 |
| ISBN (Electronic) | 978-0-9934207-9-5 |
| Publication status | Published online - 1 Sept 2025 |
| Event | IMVIP 2025 - Ulster University, Derry~Londonderry, Northern Ireland, Londonderry, United Kingdom Duration: 1 Sept 2025 → 3 Sept 2025 https://imvipconference.github.io/ |
Conference
| Conference | IMVIP 2025 |
|---|---|
| Country/Territory | United Kingdom |
| City | Londonderry |
| Period | 1/09/25 → 3/09/25 |
| Internet address |
Bibliographical note
Paper accepted and presented as a poster at IMVIP 2025Keywords
- Emotion Recognition
- Ensemble Learning
- Machine Learning
- Computer Vision
- Bagging ensembles