KSU rich Arabic speech database

Mansour Alsulaiman, Ghulam Muhammad, Mohamed A. Bencherif, Awais Mahmood, Zulflqar Ali

Research output: Contribution to journalArticle

13 Citations (Scopus)
27 Downloads (Pure)

Abstract

Arabic is one of the major languages in the world. Unfortunately not so much research in Arabic speaker recognition has been done. One main reason for this lack of research is the unavailability of rich Arabic speech databases. In this paper, we present a rich and comprehensive Arabic speech database that we developed for the Arabic speaker/speech recognition research and/or applications. The database is rich in different aspects: (a) it has 257 speakers; (b) the speakers are from different ethnic groups: Saudis, Arabs, and non-Arabs; (c) utterances are both read text and spontaneous; (d) scripts are of different dimensions, such as, isolated words, digits, phonetically rich words, sentences, phonetically balanced sentences, paragraphs, etc.; (e) different sets of microphones with medium and high quality; (f) telephony and non-telephony speech; (g) three different recording environments: office, sound proof room, and cafeteria; (h) three diiferent sessions, where the recording sessions are scheduled at least with 2 weeks interval. Because of the richness of this database, it can be used in many Arabic, and non-Arabic, speech processing researches, such as speaker/speech recognition, speech analysis, accent identification, ethnic groups/nationality recognition, etc. The richness of the database makes it a valuable resource for research in Arabic speech processing in particular and for research in speech processing in general. The database was carefully manually verified. The manual verification was complemented with automatic verification. Validation was performed on a subset of the database where the recognition rate reached 100% for Saudi speakers and 96% for non-Saudi speakers by using a system with 12 Mel frequency Cepstral coefficients, and 32 Gaussian mixtures.

Original languageEnglish
Pages (from-to)4231-4253
Number of pages23
JournalInformation (Japan)
Volume16
Issue number6 B
Publication statusPublished - Jun 2013

Keywords

  • Arabic speech database
  • Phonetically
  • Rich database
  • Speaker recognition
  • Speech corpus

Fingerprint Dive into the research topics of 'KSU rich Arabic speech database'. Together they form a unique fingerprint.

  • Cite this

    Alsulaiman, M., Muhammad, G., Bencherif, M. A., Mahmood, A., & Ali, Z. (2013). KSU rich Arabic speech database. Information (Japan), 16(6 B), 4231-4253.