Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions

Ahmed Al-Nasheri, Ghulam Muhammad, Mansour Alsulaiman, Zulfiqar Ali, Khalid H. Malki, Tamer A. Mesallam, Mohamed Farahat Ibrahim

Research output: Contribution to journalArticlepeer-review

125 Citations (Scopus)
1472 Downloads (Pure)

Abstract

Automatic voice pathology detection and classification systems effectively contribute to the assessment of voice disorders, enabling the early detection of voice pathologies and the diagnosis of the type of pathology from which patients suffer. This paper concentrates on developing an accurate and robust feature extraction for detecting and classifying voice pathologies by investigating different frequency bands using autocorrelation and entropy. We extracted maximum peak values and their corresponding lag values from each frame of a voiced signal by using autocorrelation as features to detect and classify pathological samples. We also extracted the entropy for each frame of the voice signal after we normalized its values to be used as the features. These features were investigated in distinct frequency bands to assess the contribution of each band to the detection and classification processes. Various samples of the sustained vowel /a/ for both normal and pathological voices were extracted from three different databases in English, German, and Arabic. A support vector machine was used as a classifier. We also performed u-Tests to investigate if there is a significant difference between the means of the normal and pathological samples. The best achieved accuracies in both detection and classification varied depending on the used band, method, and database. The most contributive bands in both detection and classification were between 1000 and 8000 Hz. The highest obtained accuracies in the case of detection were 99.69%, 92.79%, and 99.79% for Massachusetts eye and ear infirmary (MEEI), Saarbrücken voice database (SVD), and Arabic voice pathology database (AVPD), respectively. However, the highest achieved accuracies for classification were 99.54%, 99.53%, and 96.02% for MEEI, SVD, and AVPD, correspondingly, using the combined feature.

Original languageEnglish
Pages (from-to)6961-6974
Number of pages14
JournalIEEE Access
Volume6
DOIs
Publication statusPublished (in print/issue) - 20 Apr 2017

Keywords

  • Arabic voice pathology database (AVPD)
  • frequency investigation
  • Massachusetts eye and ear infirmary (MEEI)
  • Saarbrücken voice database (SVD)
  • Voice pathology detection and classification

Fingerprint

Dive into the research topics of 'Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions'. Together they form a unique fingerprint.

Cite this