基于正弦模型的语音处理分析与合成技术

需积分: 9 37 浏览量更新于2024-07-16 收藏 7.97MB PDF 举报

《基于正弦模型的语音处理》是一篇发表于1988年的论文，由R.J. McAulay和T.F. Quatieri共同撰写。该研究提出了一种利用正弦模型进行语音分析与合成的技术。在语音信号的处理中，这种方法将语音分解为一系列正弦波，每个正弦波由其幅度、频率和相位参数来描述。这些参数的估计是通过短时傅立叶变换（STFT）对输入语音进行简单峰值检测算法实现的。论文的核心思想是，通过高频分辨率的谱成分分析，使用频率匹配算法跟踪快速变化的特征。当一个特定频率的轨迹被追踪到时，会应用一个三次相位函数到一个正弦波发生器上，这个发生器的输出会被幅度调制，并与其他频率轨道的正弦波合成。这种合成后的语音输出保持了原始语音的基本波形形状，且几乎难以与原信号区分。这种技术在一定程度上保留了语音的感知特性，即使在噪声环境中也能保持清晰度。令人欣喜的是，研究发现这一方法在广泛的输入条件下都能实现高质量的语音再现，包括各种语速、音调和口音。这种基于正弦模型的处理方式在当时的数字信号处理领域具有重要意义，因为它提供了一种有效且灵活的方法来理解和生成语音信号，对于语音识别、语音合成和降噪等应用场景具有实用价值。它标志着在语音处理技术中对复杂信号的深入理解以及对简洁数学模型的有效应用的里程碑。

McAulay

aI. - Speech Processing

Based

Sinusoidal

Model

where

from

which

follows

that

the

optimal

estimate

for

the

amplitude

and

phase

(16)

(17)

l = i

(13)

l ~ i

+(N+

[IYlw~)-'Y~12-IYlW~)12I.

1=1

sine

(w~

w~)

sine

[(l-

w~l

={6

.y~

Y(lw;)

•

which

reduces

the

error

where

Then

the

error

expression

re-

duces

Iy(n)

2(N

+ 1) Re I

(')'k)

y(w

)

I 1

(14)

(N+

1'Y~12

1=1

Ylw) =

y(n)

exp

(-jnw)

(15)

the

STFT of

the

measurement

signal. By

completing

the

square

Eq. 14.

the

error

can

written

From

this

calculation

follows.

therefore,

that

the

error

minimized

selecting

all

the

har-

monic

frequencies

the

speech

bandwidth,

(ie, L

fl/w~).

Equations

and

completely

specify

the

structure

the

ideal

estimator

and

show

that

the

optimal

estimator

depends

the

speech

data

through

the

STFT (Eq. 15).

Although

these

results

are

eqUivalent

Fourier-series

repre-

sentation

a periodic waveform,

the

results

lead

intuitive

generalization

the

practical

case.

This

done

considering

the

functionlY

(w)

continuous

function

For

the

idealized

voiced-speech

case,

this

function

(called a periodogram) will

pulse-

s(n) =

'Y~

exp

Unlw~)

(12)

1=1

represents

the

complex

amplitude

for

the

component

the

sine

waves.

Since

the

measurements

are

made

digitized

speech,

sampled-data

notation

[s(n)]

used.

this

respect

the

time

index

corresponds

the

uniform

samples

t- t

;

therefore n

ranges

from

-N/2

N/2.

with

n = 0

reset

the

center

the

analysis

window

for every

frame

and

where

N+1

the

duration

the

analysis

window.

The

problem

now

fit

the

synthetic

speech

wave-

form

Eq. 9

the

measured

waveform.

de-

noted

y(n).

useful

criterion

for

judging

the

quality

fit

the

mean-squared

error

Iy(n) _ s(n)

(10)

Iy(n)

_ 2

y(n)

s*(n)

Is(n)

n n n

Substituting

the

speech

model

Eq. 9

into

Eq.

leads

the

error

expression

Iy(n) 1

2 Re

('Y~)*

y(n) exp

(-jnw~)

1=1

(11)

+ 1)

~ ~

('Y

sine

)

I 1 I 1

1=1 1=1

where

sinc(x)

sin

[(N+

l)x/21/[(N

sin

(x/2)].

The

task

the

estimator

identifya

set

sine

waves

that

minimizes

Eq. 11.

Insights

into

the

development

suitable

estimator

can

obtained

restricting

the

class

input

signals

the

idealization

perfectly voiced

speech,

ie,

speech

that

periodic,

hence

having

compo-

nent

sine

waves

that

are

harmonically

related.

this

case

the

synthetic

speech

waveform

can

written

where

21T/T~

and

where

the

pitch

period

assumed

constant

over

the

duration

the

frame.

For

the

purpose

establishing

the

structure

the

ideal

estimator.

further

assumed

that

the

pitch

period

known

and

that

the

width

the

analysis

window

multiple

•

Under

these

highly

idealized

conditions.

the

sinc

(.)

function

the

last

term

Eq. 11

reduces

156

The

Lincoln

Laboratory

Journal.

Volume

Number

(l988)

剩余15页未读，继续阅读

EthanLifeGreat

粉丝: 151
资源: 1

基于正弦模型的语音处理分析与合成技术

A new 1.2 kbit/s speech coding method based on a sinusoidal harmonic vocoder

IEC 61300-2-1-2023 Part 2-1 Tests – Vibration (sinusoidal).rar

Blind Detection Algorithm Based on Composite Sinusoidal Chaotic Neural Network

大二电路理论：chapter12_non-sinusoidal circuits2012.pdf

1.55-\mu m coherent lidar based on SPA sinusoidal frequency demodulation techniques

大二电路理论：chapter8_Sinusoidal Steady-state2013.pdf

A generalized sinusoidal model and its applications (2009年)

利用基波分析法对LLC谐振电路进行分析.pdf.pdf

Control of Vortex Structures on a Rectangular Slab via a Sinusoidal Surface

变频器说明书系列-APP-A.pdf

最新资源