Abstract
Speech Emotion Recognition (SER) refers to accurately predicting human emotions from their speech. The ability to predict emotions through speech signals is a motivating factor in achieving Human-Computer Interaction (HCI). This paper contains a comparative study of the existing research on speech emotion models. It makes use of the RAVDESS and SAVEE dataset containing audio input. The study of speech emotion recognition is made on SVM, CNN, KNN, MLP, Decision Tree, XGBoost, and Random Forest models. This paper presents a comparative analysis of the models highlighting the accuracy, F1 Score, bar plots, and loss graphs of the same. The paper also highlights the significant future areas for study in speech emotion recognition.
Original language | English |
---|---|
Title of host publication | 2023 International Conference on Computational Intelligence, Communication Technology and Networking, CICTN 2023 |
Publisher | IEEE |
Pages | 499-505 |
Number of pages | 7 |
ISBN (Electronic) | 979-8-3503-3802-7, 979-8-3503-3803-4 |
DOIs | |
Publication status | Published (in print/issue) - 7 Jun 2023 |
Keywords
- human computer interaction
- support vector machine
- Emotion Recognition
- Analytical models
- computational modeling
- Speech recogniion
- Forestry