Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

Tamer A. Mesallam, Mohamed Farahat, Khalid H. Malki, Mansour Alsulaiman, Zulfiqar Ali, Ahmed Al-Nasheri, Ghulam Muhammad

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.

LanguageEnglish
Article number8783751
Pages1-13
Number of pages13
JournalJournal of Healthcare Engineering
Volume2017
DOIs
Publication statusPublished - 19 Oct 2017

Fingerprint

Pathology
Learning algorithms
Learning systems
Databases
Voice Disorders
Machine Learning
Language Disorders
Ethnic Groups
Ear

Cite this

Mesallam, Tamer A. ; Farahat, Mohamed ; Malki, Khalid H. ; Alsulaiman, Mansour ; Ali, Zulfiqar ; Al-Nasheri, Ahmed ; Muhammad, Ghulam. / Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms. In: Journal of Healthcare Engineering. 2017 ; Vol. 2017. pp. 1-13.
@article{cadfba846ef94343a13ae46a2b9f3911,
title = "Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms",
abstract = "A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.",
author = "Mesallam, {Tamer A.} and Mohamed Farahat and Malki, {Khalid H.} and Mansour Alsulaiman and Zulfiqar Ali and Ahmed Al-Nasheri and Ghulam Muhammad",
year = "2017",
month = "10",
day = "19",
doi = "10.1155/2017/8783751",
language = "English",
volume = "2017",
pages = "1--13",
journal = "Journal of Healthcare Engineering",
issn = "2040-2295",

}

Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms. / Mesallam, Tamer A.; Farahat, Mohamed; Malki, Khalid H.; Alsulaiman, Mansour; Ali, Zulfiqar; Al-Nasheri, Ahmed; Muhammad, Ghulam.

In: Journal of Healthcare Engineering, Vol. 2017, 8783751, 19.10.2017, p. 1-13.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

AU - Mesallam, Tamer A.

AU - Farahat, Mohamed

AU - Malki, Khalid H.

AU - Alsulaiman, Mansour

AU - Ali, Zulfiqar

AU - Al-Nasheri, Ahmed

AU - Muhammad, Ghulam

PY - 2017/10/19

Y1 - 2017/10/19

N2 - A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.

AB - A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.

UR - http://www.scopus.com/inward/record.url?scp=85042061834&partnerID=8YFLogxK

U2 - 10.1155/2017/8783751

DO - 10.1155/2017/8783751

M3 - Article

VL - 2017

SP - 1

EP - 13

JO - Journal of Healthcare Engineering

T2 - Journal of Healthcare Engineering

JF - Journal of Healthcare Engineering

SN - 2040-2295

M1 - 8783751

ER -