基于MFCC与RASTA-PLP的汉语重音检测短时谱特征研究

需积分: 10 46 浏览量更新于2024-08-11 收藏 622KB PDF 举报

本文主要探讨了"短时谱特征的汉语重音检测方法研究"这一主题，发表于2014年的《计算机科学与探索》期刊。作者赵云雪、张珑和郑世杰着重研究了在汉语语音处理领域如何利用短时谱特性来准确识别和检测重音。重音在口语交流中起着至关重要的作用，因为它影响了语义理解和情感表达。论文的核心内容涉及使用两种常见的短时谱分析方法：Mel频率倒谱系数（MFCC）和相对幅度谱感知线性预测（RASTA-PLP）。这两种算法被用来提取语音信号的频谱特征，以便捕捉到说话者在强调某些词或音节时的独特模式。通过这些特征集，研究者构建了基于MFCC和RASTA-PLP的两个独立模型。选取朴素贝叶斯分类器作为模型构建工具，这是因为朴素贝叶斯分类器以其简单高效和在高维数据上的良好性能而知名，适用于处理文本和语音特征。研究者对这两个模型进行了训练和测试，评估了它们在汉语重音检测任务中的性能，包括精确度、召回率和F1分数等关键指标。文章提供了详细的实验设计，包括数据集的选择、特征工程的过程以及模型评估的方法。此外，论文还讨论了可能影响结果的因素，如说话人的口音、语速、以及可能存在的噪声干扰。作者通过对实验结果的深入分析，揭示了哪种短时谱特征集在汉语重音检测方面表现更优，并为后续的研究工作提供了有价值的参考。这篇论文不仅介绍了短时谱特征在汉语重音检测中的应用，而且还展示了通过统计学习方法优化语音处理任务的具体实践，对于语音识别和自然语言处理领域的工程师和技术人员来说，是一篇颇具价值的技术论文。通过阅读这篇论文，读者可以了解如何将听觉模型与实际应用场景相结合，提高语音信号处理的准确性和实用性。

赵云雪，张珑，郑世杰.短时谱特征的汉语重音检测方法研究[J].计算机科学与探索，2014，8（9）：1120-1128.

ISSN 1673-9 418 CODEN JKYTA8

Journal of Frontiers of Computer Science and Technology

1673-9 418/2014/08( 09)-1120-09

doi: 1 0.3778/j.issn.1673-9418.1407004

E-mail: fcst@vip.1 63.co m

http://www.ceaj.org

Tel: +86-10-89056056

短时谱特征的汉语重音检测方法研究

赵云雪

，张珑

1,2+

，郑世杰

1. 哈尔滨师范大学计算机科学与信息工程学院，哈尔滨 150025

2. 哈尔滨工业大学计算机科学与技术学院，哈尔滨 1500 01

Chinese Accent Detection Method Research Based on Short-Time Spectrum Features

􀆽

ZH AO Yunxue

, ZHANG Long

1,2+

, ZHENG Shijie

1. College of Computer Science and Information Engi ne ering, Harbin Normal University, Harbin 150025, C hina

2. School of Computer Science an d Technology, Harbin Institute of Technology, Harbin 150001, China

+ Cor responding author: E-mail: zlwalkman@sina.com

ZHAO Yunxue, ZHANG Long, ZHENG Shijie. Chinese accent detection method research based on short-time

spectrum features. Journal of Frontiers of Computer Science and Technology, 2014, 8(9)：1120-1128.

Abstr ac t: Accent is a critically important component of spoken communication, and plays a very i mportant role in

spoken communicatio n. In o rder to veri fy the effect of s hort-time spectrum feature set based on auditory model in

Chinese accent detection method, this paper uses MFCC (Mel frequency cepstrum coefficient) algorithm and RASTA-

PLP (relative spectra perceptual linear prediction) algorithm to extract each v oice segment of short-time spectrum

information, and builds short-time spectrum feature sets based on MFCC a lgorithm and RASTA-PLP algorithm.

T he n, it chooses NaiveBayes classif ier to model the two feat ure sets, and chooses the classes with maximum a poste-

ri ori pro ba bility a s the object’s class. This classification method makes full use of th e related phonet ic features of

speech segment. Sh ort-time spectrum feature set based on MFCC and short-time spectrum feature set based on RASTA-

PLP respectively achieve 82.1% and 80.8% accent detection accuracy on ASCCD (annotated speech corpus of Chi-

nese discourse). The experimental results indicate that short-time spectrum features based on MFCC and short-time

spectrum features based on RASTA - PLP can be used for Chinese accent detection resear ch .

* The Natural Science Foundation of Heil ongjiang Province of China under Grant No. F2 01321 (黑龙江省自然科学基金); the Philoso-

phy an d Social Sciences Foreign Language Joint Research Foundation of Heilongjiang Province of China under G rant No. 12H007

(黑龙江省哲学社会科学外语联合研究项目).

Received 2014-06, Accepted 2014-08.

CNKI网络优先出版：2014-08-19, http://www.cnki.net/kcms/doi/10.3778/j.issn.1 673-9418.1407004.html

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38572115

粉丝: 6
资源: 946

基于MFCC与RASTA-PLP的汉语重音检测短时谱特征研究

论文研究-维吾尔语的重音检测.pdf

基于DNN建模的普通话重音检测1

汉语重音的凸显度分析与合成

英语教学系统中的词重音检测 (2008年)

汉语中Tone2 + Tone2复音词重音的声学相关性初步研究

AccentDetection:使用有限状态语音识别的概率重音检测

重音

研究生专业英语构词法&重音规则

基于注意力的端到端韵律结构和重音联合预测方法

多分类器组合在英语词重音检测中的应用

最新资源