TY - GEN
T1 - Rebalancing Techniques for Asynchronously Distributed EEG Data to Improve Automatic Seizure Type Classification
AU - McCallan, Niamh
AU - Davidson, Scot
AU - Ng, Kok Yew
AU - Biglarbeigi, Pardis
AU - Finlay, D
AU - Lan, Boon Leong
AU - McLaughlin, James
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023/4/10
Y1 - 2023/4/10
N2 - Epilepsy, a nervous system disorder, is characterised by unprovoked, unpredictable, and recurrent seizures. To diagnose epileptic seizures, electroencephalography (EEG) is frequently used in medical settings. Effective automated detection and classification strategies are needed because visual analysis and interpretation of EEG signals consume time and call for specialised expertise. The main objective of this paper is to examine the effectiveness of multiple rebalancing techniques to address the problem of asynchronously distributed data, specifically employing random resampling, synthetic minority oversampling technique (SMOTE), and adaptive synthetic sampling approach for imbalanced learning (ADASYN), for seizure type classification. The model utilises both frequency information using variational mode decomposition (VMD), and phase information by extracting the phase locking value (PLV) across 19 common EEG channels found in the Temple University Hospital EEG Seizure Corpus (TUSZ) v1.5.2 dataset. The random subspace k-nearest neighbour (RSkNN) ensemble classifier is used for seizure type classification of five classes - complex partial seizures (CPSZ), simple partial seizures (SPSZ), absence seizures (ABSZ), tonic clonic seizures (TCSZ), and tonic seizures (TNSZ) - to determine the performance of each rebalancing techniques, with the highest accuracy and weighted F 1 score of 96.28% and 0.964, respectively using SMOTE with two nearest neighbours.
AB - Epilepsy, a nervous system disorder, is characterised by unprovoked, unpredictable, and recurrent seizures. To diagnose epileptic seizures, electroencephalography (EEG) is frequently used in medical settings. Effective automated detection and classification strategies are needed because visual analysis and interpretation of EEG signals consume time and call for specialised expertise. The main objective of this paper is to examine the effectiveness of multiple rebalancing techniques to address the problem of asynchronously distributed data, specifically employing random resampling, synthetic minority oversampling technique (SMOTE), and adaptive synthetic sampling approach for imbalanced learning (ADASYN), for seizure type classification. The model utilises both frequency information using variational mode decomposition (VMD), and phase information by extracting the phase locking value (PLV) across 19 common EEG channels found in the Temple University Hospital EEG Seizure Corpus (TUSZ) v1.5.2 dataset. The random subspace k-nearest neighbour (RSkNN) ensemble classifier is used for seizure type classification of five classes - complex partial seizures (CPSZ), simple partial seizures (SPSZ), absence seizures (ABSZ), tonic clonic seizures (TCSZ), and tonic seizures (TNSZ) - to determine the performance of each rebalancing techniques, with the highest accuracy and weighted F 1 score of 96.28% and 0.964, respectively using SMOTE with two nearest neighbours.
KW - Electroencephalography
KW - Epileptic seizure
KW - Classification
KW - Machine learning
KW - ADASYN
KW - SMOTE
UR - https://pure.ulster.ac.uk/en/publications/22640002-be7e-4242-935c-61213e551ffc
UR - http://www.scopus.com/inward/record.url?scp=85154050167&partnerID=8YFLogxK
U2 - 10.1109/CISS56502.2023.10089669
DO - 10.1109/CISS56502.2023.10089669
M3 - Conference contribution
SN - 978-1-6654-5181-9
SN - 978-1-6654-5182-6
T3 - 2023 57th Annual Conference on Information Sciences and Systems (CISS)
BT - 2023 57th Annual Conference on Information Sciences and Systems (CISS)
PB - IEEE
ER -