TY - GEN
T1 - Small Object Detection with Super-Resolution Cascaded YOLOv4: An Average Precision Analysis
AU - Javed, Sharaz
AU - Khan, Muhammad U.S.
AU - Ali Khan, Irtiza
AU - Mahmud, Maqsood
AU - Saeed, Muhammad Tariq
PY - 2026/1/13
Y1 - 2026/1/13
N2 - Object detection is a fundamental aspect of computer vision, especially for satellite image analysis. Detecting small objects remains challenging due to limited resolution and features distortion, impede deep learning models’ learning ultimately impacting detection performance. To address this challenge, this paper investigates regression-based super-resolution (SR) models to generate high-resolution satellite images, aiming to enhance small object clarity and detail, and focus on quantitatively evaluate the influence of SR on average precision (AP) of small object detection. For this, we propose a deep learning pipeline, cascades regression-based SR architectures, including Super-Resolution Convolutional Neural Network (SRCNN), Very Deep Super-Resolution (VDSR) network, and Residual Channel Attention Network (RCAN), individually with the YOLOv4 to detect the small objects, particularly the "Planes" class in the DOTA v1.5 dataset. Experimental results indicate that SRCNN3(3-layer) outperforms VDSR20 and RCAN variants (RCAN20 & RCAN100) in mean BRISQUE scores but does not improve YOLOv4 detection when cascaded, likely due to alters textures and edges in manner that is suboptimal for YOLOv4-based detection. Conversely, RCAN100 demonstrate encouraging results, potentially improving detection performance, despite lower BRISQUE scores than SRCNN3. This indicates that cascading regression-based SR does not necessarily guarantee to enhance YOLOv4-based object detection performance. The source code is available at https://github.com/SharazJaved/SR_YOLO_DOTA.
AB - Object detection is a fundamental aspect of computer vision, especially for satellite image analysis. Detecting small objects remains challenging due to limited resolution and features distortion, impede deep learning models’ learning ultimately impacting detection performance. To address this challenge, this paper investigates regression-based super-resolution (SR) models to generate high-resolution satellite images, aiming to enhance small object clarity and detail, and focus on quantitatively evaluate the influence of SR on average precision (AP) of small object detection. For this, we propose a deep learning pipeline, cascades regression-based SR architectures, including Super-Resolution Convolutional Neural Network (SRCNN), Very Deep Super-Resolution (VDSR) network, and Residual Channel Attention Network (RCAN), individually with the YOLOv4 to detect the small objects, particularly the "Planes" class in the DOTA v1.5 dataset. Experimental results indicate that SRCNN3(3-layer) outperforms VDSR20 and RCAN variants (RCAN20 & RCAN100) in mean BRISQUE scores but does not improve YOLOv4 detection when cascaded, likely due to alters textures and edges in manner that is suboptimal for YOLOv4-based detection. Conversely, RCAN100 demonstrate encouraging results, potentially improving detection performance, despite lower BRISQUE scores than SRCNN3. This indicates that cascading regression-based SR does not necessarily guarantee to enhance YOLOv4-based object detection performance. The source code is available at https://github.com/SharazJaved/SR_YOLO_DOTA.
KW - Small Object Detection
KW - Satellite Imagery
KW - DOTA v1.5, Super-Resolution (SR)
KW - YOLOv4
KW - SRCNN
KW - VDSR
KW - RCAN
KW - Average Precison (AP)
UR - https://pure.ulster.ac.uk/en/publications/e6ebaa4b-85fe-4754-82fb-ebac68507a10
U2 - 10.1109/fit67061.2025.11333743
DO - 10.1109/fit67061.2025.11333743
M3 - Conference contribution
SN - 979-8-3315-7481-9
T3 - 2025 International Conference on Frontiers of Information Technology (FIT)
SP - 1
EP - 6
BT - 2025 International Conference on Frontiers of Information Technology (FIT)
PB - IEEE
T2 - 2025 International Conference on Frontiers of Information Technology (FIT)
Y2 - 15 December 2025 through 16 December 2025
ER -