Comparative Analysis of Speech Emotion Recognition Models and Technique

A. Agrawal, A. Jain, B. Kaur, S. Jangid, K. Kadian, V. Dwivedi, S. Garhwal

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Speech Emotion Recognition (SER) refers to accurately predicting human emotions from their speech. The ability to predict emotions through speech signals is a motivating factor in achieving Human-Computer Interaction (HCI). This paper contains a comparative study of the existing research on speech emotion models. It makes use of the RAVDESS and SAVEE dataset containing audio input. The study of speech emotion recognition is made on SVM, CNN, KNN, MLP, Decision Tree, XGBoost, and Random Forest models. This paper presents a comparative analysis of the models highlighting the accuracy, F1 Score, bar plots, and loss graphs of the same. The paper also highlights the significant future areas for study in speech emotion recognition.
Original languageEnglish
Title of host publication2023 International Conference on Computational Intelligence, Communication Technology and Networking, CICTN 2023
PublisherIEEE
Pages499-505
Number of pages7
ISBN (Electronic)979-8-3503-3802-7, 979-8-3503-3803-4
DOIs
Publication statusPublished (in print/issue) - 7 Jun 2023

Keywords

  • human computer interaction
  • support vector machine
  • Emotion Recognition
  • Analytical models
  • computational modeling
  • Speech recogniion
  • Forestry

Fingerprint

Dive into the research topics of 'Comparative Analysis of Speech Emotion Recognition Models and Technique'. Together they form a unique fingerprint.

Cite this