DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION  BASED ON IMPROVED K-MEAN CLUSTERING

Li Qiaojun; Guo Guo

doi:10.3969/j.issn.1000-386x.2024.09.032

Li Qiaojun, Guo Guo. DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION BASED ON IMPROVED K-MEAN CLUSTERING[J]. Computer Applications and Software, 2024, 41(9): 224-229. DOI: 10.3969/j.issn.1000-386x.2024.09.032

Citation:

DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION BASED ON IMPROVED K-MEAN CLUSTERING

Li Qiaojun,
Guo Guo

Graphical Abstract

Graphical Abstract

Abstract

Abstract

Aimed at the problems of low accuracy and high time complexity in current speech emotion recognition (SRE) methods, a deep learning method for speech emotion recognition based on the improved k-mean clustering is proposed. The improved k-mean clustering algorithm was used to select the key segments which reflected the emotional features from the whole audio signal. The selected sequence was transformed into a spectrum by using short-time Fourier transform. The deep residual model ResNet and deep Bi-LSTM network were used to learn the hidden features related to emotion in the representation spectrum from space and time. The final sentiment classification was obtained based on Softmax classifier. Experimental results show that the proposed method has obvious advantages over other recognition methods, which improves the emotion recognition rate and reduces the processing time of the model.

FullText(HTML)

References (0)

Cited By

Turn off MathJax

Article Contents

DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION BASED ON IMPROVED K-MEAN CLUSTERING

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content