A NEW APPROACH FOR SPEECH EMOTION RECOGNITION USING SINGLE LAYERED CONVOLUTIONAL NEURAL NETWORK

Authors

  • Mannar Mannan. J Department of Information Science and Engineering, CMR Institute of Technology, Bengaluru, India
  • V Vinoth Kumar School of Computer Science Engineering & Information Systems (SCORE), Vellore Institute of Technology, India
  • Shivakumara Palaiahnakote School of Science, Engineering and Environment, University of Salford, United Kingdom
  • Surbhi Bhatia Khan Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon
  • Ahlam Almusharraf Department of Business Administration, College of Business and Administration, Princess Nourah bint Abdulrahman University, Saudi Arabia

DOI:

https://doi.org/10.22452/mjcs.vol37no1.6

Keywords:

Analysis of variance; Speech emotion recognition; Deep learning; CNN; Cosine-similarity measurement.

Abstract

Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new light-weight model for addressing challenges of emotions recognition. The model works based on CNN with change of kernel processing. The proposed model performs a direct matching to recognize speech emotions of different eight categories using a statistical model named Analysis of Variance (ANOVA) as kernel for features extraction and Cosine Similarity Measurement (CSM) as activation function for CNN model. This proposed model contains eight-folded single-layered intermediate neurons, and each neuron can segregate speech emotion pattern using CSM from the voice convergence matrix to explore a part of the solution from the whole solution. Experiment results demonstrates that the proposed model outperforms compared with multiple layered existing CNN methods in identifying the emotional state of a speaker.

Downloads

Download data is not yet available.

Downloads

Published

2024-01-31

How to Cite

J, M. M., Kumar, V. V. ., Palaiahnakote, S. ., Khan, S. B. ., & Almusharraf, A. . (2024). A NEW APPROACH FOR SPEECH EMOTION RECOGNITION USING SINGLE LAYERED CONVOLUTIONAL NEURAL NETWORK. Malaysian Journal of Computer Science, 37(1), 89–106. https://doi.org/10.22452/mjcs.vol37no1.6