提升助听效果：一种新型耳蜗植入语音编码算法

105 浏览量更新于2024-08-27 收藏 251KB PDF 举报

"这篇研究论文提出了一种新颖的语音编码算法，用于改善耳蜗植入物的性能，以增强在噪声环境中的语音识别以及音调语言和音乐感知。该算法称为希尔伯特-黄变换刺激（HHTS）方法，利用希尔伯特-黄变换（HHT）对非线性和非平稳信号进行分析，从而提取瞬时频率信息。" 在当前的听力恢复技术中，耳蜗植入物（CI）是为重度至极重度感音神经性听力损失患者恢复部分听力的重要手段。尽管耳蜗植入物已经取得了显著的进步，但用户在嘈杂环境中识别语音、理解和感知音调语言（如汉语等）以及音乐方面仍然面临挑战。为了克服这些难题，本研究论文提出了HHTS算法。希尔伯特-黄变换（HHT）是一种强大的信号处理工具，由经验模态分解（EMD）的筛选过程和希尔伯特变换（HT）组成。EMD是一种自适应的数据分析方法，能将复杂的信号分解成一系列简单的内在模态函数（IMF）。这些IMF代表了信号的不同频率成分，而希尔伯特变换则用于获取每个IMF的瞬时频率和振幅信息。这种瞬时特性对于理解和解析非线性、非平稳信号至关重要，如人类语音和音乐中的动态变化。在HHTS算法中，首先通过EMD对输入的语音信号进行分解，然后应用希尔伯特变换来获取每个IMF的瞬时频率。这种方法允许更精确地捕捉语音信号的动态特征，尤其是在噪声环境下。通过强调关键的频率成分，HHTS能够提高耳蜗植入者在噪声背景下的语音识别能力，同时可能改善他们对音调语言的感知，因为音调语言的语义很大程度上依赖于声音的频率变化。此外，HHTS还可能有助于音乐感知的提升，因为音乐中的旋律和节奏都与频率的动态变化密切相关。通过提供更清晰的频率信息，耳蜗植入者可能能够更好地解析音乐的结构和情感表达。这篇研究论文的贡献在于提出了一种创新的语音编码策略，即HHTS，它利用希尔伯特-黄变换的强大力量来增强耳蜗植入用户的听觉体验。这种方法有可能显著改善他们在复杂听觉场景中的性能，从而提高生活质量。然而，实际效果需要通过临床试验进一步验证，并可能需要对现有的耳蜗植入系统进行硬件和软件的优化来实现这一技术的应用。

2012 5th International Congress on Image and Signal Processing (CISP 2012)

A Novel Speech Coding Algorithm for Cochlear

Implants

Hongyun LIU Weidong WANG

Kaiyuan LI Zhengbo ZHANG

Department of Medical Engineering & Supply Center,

Chinese PLA General Hospital,

Beijing, China

Abstract—Cochlear implants (CI) can restore some degree of

hearing to individuals with severe to profound sensorineural

hearing loss. In recent years, new speech coding algorithms were

developed for improving the performance of cochlear implants,

but sound recognition in noisy environment, tonal language and

music perception remain very difficult for most cochlear implant

users. To enhance speech recognition in noise, as well as tonal

language and music perception, a new speech coding algorithm

called Hilbert Huang Transform Stimulating(HHTS) for cochlear

implants was presented. HHT is a powerful tool which consists of

sifting procedure of empirical mode decomposition (EMD) and

the Hilbert Transform (HT) to analyze non-linear and non-

stationary signal. Instantaneous frequency could be derived from

time-frequency description of speech signal in the sifting

procedure and a lot of information comprised in fine structure is

not only reflection of speech contents, speech rhythms and tones,

but also speakers’ individual characteristics, so that have to get

finer envelope and fine structure properties of speech. HHTS,

continuous interleaved sampling (CIS), channel specific sampling

sequences (CSSS), frequency amplitude modulation encoding

(FAME) strategies were simulated based on MATLAB.

Synthesized stimulus and their spectrum were correlation

analyzed between original signals. Compared to other 3

strategies, HHTS obtain the highest correlation coefficient

between spectrum of synthesized signal and that of original

speech. The spectrum of synthesized signal through HHTS

strategy is the most correlated to that of original speech, and the

correlation is significant.

Keywords-Cochlear implant; Hilbert Huang Transform;

Empirical mode decomposition; Hilbert Transform;

I. INTRODUCTION

Cochlear implants are accepted as the unique medical

device which can restore partial hearing to individuals with

severe to profound sensorineural hearing loss through electric

stimulation of residual auditory nerve. As of December 2010,

approximately 219,000 people worldwide have received

cochlear implants; in the U.S., roughly 42,600 adults and

28,400 children are recipients. Cochlear implants have been

remarkably successful in providing hearing to the profoundly

deaf, and the modern multichannel cochlear implants produce

word recognition scores around 80% for sentences in quiet,

allowing the majority of their users to talk on the phone

fluently [1][2]. However, the speech perception attainable by

cochlear implant users has reached a plateau with current

cochlear implant speech coding strategies. Apart from that,

serious limitations are observed in the representation of speech

in noise, tonal languages and music.

Traditional cochlear implants have generally employed two

types of speech coding algorithms. In one type, only amplitude

or envelope characteristics of original speech are extracted and

modulated a fixed rate biphasic pulse train, such as CIS

strategy [3]. In the other type, band-pass filtered raw analog

processed speech, which contains amplitude, frequency, phase

and fine structure information, are delivered directly to

electrodes to stimulate residual auditory nerve, compressed

analog (CA) is precisely this kind of strategy [4][5][6]. The two

types of strategies mentioned above each have their own

disadvantage. One of them provides too little (amplitude or

envelope modulation only) and the other provides too much

indiscriminable information [2]. During the past few years,

many speech coding algorithms, such as channel specific

sampling (CSSS)[5], wavelet zero-crossing stimulation (WZCS)

[7][8], FAME, asynchronous interleaved sampling (AIS)[9]

algorithms and so on, were presented and researched. Besides

amplitude or envelope information, frequency, phase, fine

structure and other essential components are extracted and

encoded to improve the quality of sound perception for

cochlear implant users in variety of circumstances.

Motivated by literature review and physiological evidence,

a novel speech coding algorithm was proposed to encode phase

or fine structure of original sound in cochlear implants to

improve their perception of noisy speech, tonal languages and

music. In the following sections, we will first present an

algorithm that decomposes a signal into intrinsic mode

functions to obtain slowly varying amplitude and phase or fine

structure characteristics of original speech. We term this

algorithm the Hilbert Huang Transform Stimulating (HHTS)

strategy. Computer simulation was conducted to verify the

algorithm’s accuracy and efficiency from a signal processing

point of view.

II. H

ILBERT HUANG TRANSFORM STIMULATING

ALGORITHM

A. Hilbert Transform

HT is one of the important mathematic tools in the field of

signal analysis and processing. Supposing there is a real

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38747126

粉丝: 5
资源: 921

提升助听效果：一种新型耳蜗植入语音编码算法

《SPEECH CODING ALGORITHMS》

Ultra Low Bit-Rate Speech Coding

Channel Coding Methods for Non-Volatile Memories

truncated Huffman tree

语音编码相关的参考书籍

Try to write an algorithm to Calculate the WPL of a Huffman Tree.

huffman verilog

g.729version1.5

information theory: coding theorems for discrete memoryless systems

Given a distribution P on letters, find the lowest-cost tree.

最新资源