语音情感识别：特征选择与极限学习机的结合

156 浏览量更新于2024-07-15 1 收藏 252KB PDF 举报

"语音情感识别的研究中，特征选择和机器学习模型的应用对于提高识别效果至关重要。本文提出了一种结合相关分析和Fisher准则的特征选择方法，以及基于极限学习机的决策树情感识别模型。实验在CASIA中文语音数据库上进行，证明了该方法在不同情感状态的识别上具有高效性和准确性。" 语音情感识别是一种技术，旨在通过分析人类的语音来确定说话者的情绪状态。特征选择在这个领域起着核心作用，因为它可以帮助减少不必要数据的复杂性，降低计算成本，同时提高模型的性能。文章指出，过去的研究往往忽视了同一音频源产生的特征之间的相互作用，这可能导致冗余特征的产生。为解决这个问题，作者提出了一种新的特征选择策略，即基于相关分析和Fisher准则的方法。这种方法能够识别和去除高度相关的冗余特征，从而保留更具代表性的特征集。相关分析是统计学中用于测量变量间关系强度的一种工具，它可以发现特征之间的关联性，帮助消除重复信息。Fisher准则则是一种用于特征选择的经典方法，它基于特征的区分能力，选取能最大程度区分不同类别的特征。结合这两种方法，可以有效地筛选出对情感识别至关重要的特征。极限学习机(Extreme Learning Machine, ELM)是一种单隐藏层前馈神经网络的学习算法，以其快速训练速度和良好性能而受到关注。在本文中，ELM被应用于构建决策树模型进行情感识别。由于ELM的高效学习特性，它能够处理大量特征，并且在处理非线性问题时表现优秀。在对CASIA中文语音数据库进行的情感识别实验中，结合特征选择的ELM决策树模型显示出显著的识别率提升，平均达到89.6%，这表明该方法能够有效地区分不同说话者的情感状态。此外，这种方法的潜在应用在于实现独立于说话者的计算机或机器人交互。未来的智能系统，如虚拟助手或服务机器人，可以利用这种技术更准确地理解和响应用户的情感，从而提高人机交互的质量和用户体验。总结来说，"基于特征选择和极限学习机决策树的语音情感识别"这一研究通过结合相关分析和Fisher准则优化特征选择，利用ELM决策树提高情感识别的准确性。这种方法不仅有助于降低计算复杂性，还能提高模型的泛化能力，为语音情感识别领域提供了一种新的有效工具。

recognition rate of speech emotion recognition and the universality of speech

emotion recognition system.

3.1. Speaker-dependent features

It is generally believed that the emotional features such as rhythm char-

acteristics are easily inﬂuenced by the speaker’s personal characteristics and

data distribution of feature set will change greatly for diﬀerent speakers with

similar emotional states. And these features are summarized as speaker-

dependent features of the speech, which usually contain rich personal emo-

tional information [3], as shown in Table 1, including fundamental frequency,

short-time energy, MFCC coeﬃcient, spectral energy dynamic coeﬃcients,

etc. These emotional features contain vast quantities of emotional informa-

tion, and they reﬂect personal speech characteristics of the speaker. Even in

the same emotional state, the speed and the volume of diﬀerent speaker may

have great diﬀerence.

Table 1: Speaker-dependent features.

Speaker-dependent

Speciﬁc characteristics

features

Prosodic features

Fundamental Fundamental frequency;

frequency Fundamental frequency maximum;

characteristic Four bit value of fundamental frequency

Energy related

Short-time maximum amplitude;

features

Short-time average energy;

Short-time average amplitude

Time length

Short-time average cross zero ratio;

correlation

Speed of speech

characteristic

Quality Breath sound; Brightness; Throat sound; The maximum value and

characteristics the average value of the ﬁrst, two and three formant frequencies

Spectrum 12 order MFCC coeﬃcients; Spectral energy dynamic coeﬃcients

characteristics of 12 equally spaced frequency bands

3.2. Speaker-independent features

In order to suppress the individual characteristics of diﬀerent people and

eliminate the diﬀerences in the numerical values of individualized speech

emotion characteristics, the change rate is introduced to reﬂect the changes in

剩余27页未读，继续阅读

weixin_38738528

粉丝: 5
资源: 873

语音情感识别：特征选择与极限学习机的结合

基于特征降维的语音情感识别附matlab代码.zip

【情感识别】基于特征降维的语音情感识别附matlab代码 上传.zip

基于机器学习的语音情感识别

结合极限学习机和决策树进行基于HMM的语音合成时长预测

ELM.zip_ELM分类_极限学习_极限学习机_极限学习机 ELM

极限学习机拟合和分类的MATLAB代码

BA_ELM_ba-elm_极限学习机_ELM

基于matlab语音处理-代码说明.txt

机器学习中神经网络.pdf

《MATLAB 神经网络43个案例分析》包含代码和测试数据

最新资源

【情感识别】基于特征降维的语音情感识别附matlab代码上传.zip