Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions

Ahmed Al-nasheri, Ghulam Muhammad, Mansour Alsulaiman, Zulfiqar Ali

Research output: Contribution to journalArticlepeer-review

88 Citations (Scopus)
636 Downloads (Pure)

Abstract

Objectives and Background 

Automatic voice pathology detection and classification systems effectively contribute to the assessment of voice disorders, which helps clinicians to detect the existence of any voice pathologies and the type of pathology from which patients suffer in the early stages. This work concentrates on developing an accurate and robust feature extraction for detecting and classifying voice pathologies by investigating different frequency bands using correlation functions. In this paper, we extracted maximum peak values and their corresponding lag values from each frame of a voiced signal by using correlation functions as features to detect and classify pathological samples. These features are investigated in different frequency bands to see the contribution of each band on the detection and classification processes. 


Material and Methods 

Various samples of sustained vowel /a/ of normal and pathological voices were extracted from three different databases: English, German, and Arabic. A support vector machine was used as a classifier. We also performed a t-test to investigate the significant differences in mean of normal and pathological samples. 

Results 

The best achieved accuracies in both detection and classification were varied depending on the band, the correlation function, and the database. The most contributive bands in both detection and classification were between 1000 and 8000 Hz. In detection, the highest acquired accuracies when using cross-correlation were 99.809%, 90.979%, and 91.168% in the Massachusetts Eye and Ear Infirmary, Saarbruecken Voice Database, and Arabic Voice Pathology Database databases, respectively. However, in classification, the highest acquired accuracies when using cross-correlation were 99.255%, 98.941%, and 95.188% in the three databases, respectively.

Original languageEnglish
Pages (from-to)3-15
Number of pages13
JournalJournal of Voice
Volume31
Issue number1
Early online date15 Mar 2016
DOIs
Publication statusPublished (in print/issue) - 1 Jan 2017

Keywords

  • Arabic Voice Pathology Database (AVPD)
  • Frequency investigation
  • Massachusetts Eye and Ear Infirmary (MEEI)
  • Saarbruecken Voice Database (SVD)
  • Voice pathology detection and classification

Fingerprint

Dive into the research topics of 'Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions'. Together they form a unique fingerprint.

Cite this