Li Qiaojun, Guo Guo. DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION BASED ON IMPROVED K-MEAN CLUSTERING[J]. Computer Applications and Software, 2024, 41(9): 224-229. DOI: 10.3969/j.issn.1000-386x.2024.09.032
Citation: Li Qiaojun, Guo Guo. DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION BASED ON IMPROVED K-MEAN CLUSTERING[J]. Computer Applications and Software, 2024, 41(9): 224-229. DOI: 10.3969/j.issn.1000-386x.2024.09.032

DEEP LEARNING METHOD FOR SPEECH EMOTION RECOGNITION BASED ON IMPROVED K-MEAN CLUSTERING

  • Aimed at the problems of low accuracy and high time complexity in current speech emotion recognition (SRE) methods, a deep learning method for speech emotion recognition based on the improved k-mean clustering is proposed. The improved k-mean clustering algorithm was used to select the key segments which reflected the emotional features from the whole audio signal. The selected sequence was transformed into a spectrum by using short-time Fourier transform. The deep residual model ResNet and deep Bi-LSTM network were used to learn the hidden features related to emotion in the representation spectrum from space and time. The final sentiment classification was obtained based on Softmax classifier. Experimental results show that the proposed method has obvious advantages over other recognition methods, which improves the emotion recognition rate and reduces the processing time of the model.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return