没有合适的资源?快使用搜索试试~ 我知道了~
首页黄学东Spoken Language Processing-A Guide to Theory, Algorithm and System Development.pdf
黄学东Spoken Language Processing-A Guide to Theory, Algorithm and S...

黄学东Spoken Language Processing-A Guide to Theory, Algorithm and System Development.pdf 语音识别 语者识别 语音合成 经典
资源详情
资源评论
资源推荐



i
TABLE OF CONTENTS
1. INTRODUCTION................................................................................................... 1
1.1. M
OTIVATIONS .......................................................................................................2
1.1.1. Spoken Language Interface ...................................................................2
1.1.2. Speech-to-speech Translation................................................................3
1.1.3. Knowledge Partners...............................................................................3
1.2. S
POKEN LANGUAGE SYSTEM ARCHITECTURE ........................................................4
1.2.1. Automatic Speech Recognition ..............................................................4
1.2.2. Text-to-Speech Conversion....................................................................6
1.2.3. Spoken Language Understanding..........................................................7
1.3. B
OOK ORGANIZATION ............................................................................................9
1.3.1. Part I: Fundamental Theory..................................................................9
1.3.2. Part II: Speech Processing ....................................................................9
1.3.3. Part III: Speech Recognition ...............................................................10
1.3.4. Part IV: Text-to-Speech Systems..........................................................10
1.3.5. Part V: Spoken Language Systems ......................................................10
1.4. T
ARGET AUDIENCES.............................................................................................11
1.5. H
ISTORICAL PERSPECTIVE AND FURTHER READING.............................................11
PART I: FUNDAMENTAL THEORY
2. SPOKEN LANGUAGE STRUCTURE.........................................................19
2.1. S
OUND AND HUMAN SPEECH SYSTEMS................................................................21
2.1.1. Sound ...................................................................................................21
2.1.2. Speech Production...............................................................................24
2.1.3. Speech Perception................................................................................28
2.2. P
HONETICS AND PHONOLOGY...............................................................................36
2.2.1. Phonemes.............................................................................................36
2.2.2. The Allophone: Sound and Context.....................................................47
2.2.3. Speech Rate and Coarticulation ..........................................................49
2.3. S
YLLABLES AND WORDS......................................................................................50
2.3.1. Syllables...............................................................................................51
2.3.2. Words...................................................................................................52
2.4. S
YNTAX AND SEMANTICS.....................................................................................57
2.4.1. Syntactic Constituents..........................................................................58
2.4.2. Semantic Roles.....................................................................................63
2.4.3. Lexical Semantics ................................................................................64
2.4.4. Logical Form .......................................................................................66
2.5. H
ISTORICAL PERSPECTIVE AND FURTHER READING.............................................68

ii TABLE OF CONTENTS
3. PROBABILITY, STATISTICS AND INFORMATION THEORY..73
3.1. P
ROBABILITY THEORY .........................................................................................74
3.1.1. Conditional Probability And Bayes' Rule............................................75
3.1.2. Random Variables................................................................................77
3.1.3. Mean and Variance..............................................................................79
3.1.4. Covariance and Correlation................................................................83
3.1.5. Random Vectors and Multivariate Distributions.................................84
3.1.6. Some Useful Distributions ...................................................................85
3.1.7. Gaussian Distributions ........................................................................92
3.2. E
STIMATION THEORY ...........................................................................................98
3.2.1. Minimum/Least Mean Squared Error Estimation................................99
3.2.2. Maximum Likelihood Estimation .......................................................104
3.2.3. Bayesian Estimation and MAP Estimation ........................................108
3.3. S
IGNIFICANCE TESTING.......................................................................................114
3.3.1. Level of Significance..........................................................................114
3.3.2. Normal Test (Z-Test)..........................................................................116
3.3.3.
2
χ
Goodness-of-Fit Test...................................................................117
3.3.4. Matched-Pairs Test............................................................................119
3.4. I
NFORMATION THEORY ......................................................................................121
3.4.1. Entropy ..............................................................................................121
3.4.2. Conditional Entropy ..........................................................................124
3.4.3. The Source Coding Theorem .............................................................125
3.4.4. Mutual Information and Channel Coding..........................................127
3.5. H
ISTORICAL PERSPECTIVE AND FURTHER READING...........................................129
4. PATTERN RECOGNITION...........................................................................133
4.1. B
AYES DECISION THEORY..................................................................................134
4.1.1. Minimum-Error-Rate Decision Rules ................................................135
4.1.2. Discriminant Functions .....................................................................138
4.2. H
OW TO CONSTRUCT CLASSIFIERS.....................................................................140
4.2.1. Gaussian Classifiers ..........................................................................142
4.2.2. The Curse of Dimensionality..............................................................144
4.2.3. Estimating the Error Rate..................................................................146
4.2.4. Comparing Classifiers .......................................................................148
4.3. D
ISCRIMINATIVE TRAINING ................................................................................150
4.3.1. Maximum Mutual Information Estimation.........................................150
4.3.2. Minimum-Error-Rate Estimation.......................................................156
4.3.3. Neural Networks ................................................................................158
4.4. U
NSUPERVISED ESTIMATION METHODS .............................................................163
4.4.1. Vector Quantization...........................................................................164
4.4.2. The EM Algorithm .............................................................................170
4.4.3. Multivariate Gaussian Mixture Density Estimation...........................172

TABLE OF CONTENTS iii
4.5. CLASSIFICATION AND REGRESSION TREES..........................................................176
4.5.1. Choice of Question Set.......................................................................177
4.5.2. Splitting Criteria................................................................................179
4.5.3. Growing the Tree...............................................................................181
4.5.4. Missing Values and Conflict Resolution ............................................182
4.5.5. Complex Questions ............................................................................183
4.5.6. The Right-Sized Tree..........................................................................185
4.6. H
ISTORICAL PERSPECTIVE AND FURTHER READING...........................................190
PART II SPEECH PROCESSING
5. DIGITAL SIGNAL PROCESSING..............................................................201
5.1. D
IGITAL SIGNALS AND SYSTEMS ........................................................................202
5.1.1. Sinusoidal Signals..............................................................................203
5.1.2. Other Digital Signals.........................................................................206
5.1.3. Digital Systems ..................................................................................206
5.2. C
ONTINUOUS-FREQUENCY TRANSFORMS...........................................................209
5.2.1. The Fourier Transform ......................................................................209
5.2.2. Z-Transform.......................................................................................211
5.2.3. Z-Transforms of Elementary Functions.............................................212
5.2.4. Properties of the Z and Fourier Transform .......................................215
5.3. D
ISCRETE-FREQUENCY TRANSFORMS................................................................216
5.3.1. The Discrete Fourier Transform (DFT).............................................218
5.3.2. Fourier Transforms of Periodic Signals............................................219
5.3.3. The Fast Fourier Transform (FFT)....................................................222
5.3.4. Circular Convolution.........................................................................227
5.3.5. The Discrete Cosine Transform (DCT)..............................................228
5.4. D
IGITAL FILTERS AND WINDOWS........................................................................229
5.4.1. The Ideal Low-Pass Filter .................................................................229
5.4.2. Window Functions .............................................................................230
5.4.3. FIR Filters..........................................................................................232
5.4.4. IIR Filters...........................................................................................238
5.5. D
IGITAL PROCESSING OF ANALOG SIGNALS........................................................242
5.5.1. Fourier Transform of Analog Signals................................................242
5.5.2. The Sampling Theorem......................................................................243
5.5.3. Analog-to-Digital Conversion ...........................................................245
5.5.4. Digital-to-Analog Conversion ...........................................................246
5.6. M
ULTIRATE SIGNAL PROCESSING.......................................................................247
5.6.1. Decimation.........................................................................................248
5.6.2. Interpolation ......................................................................................249
5.6.3. Resampling ........................................................................................250
5.7. F
ILTERBANKS .....................................................................................................250
5.7.1. Two-Band Conjugate Quadrature Filters..........................................250
剩余964页未读,继续阅读





安全验证
文档复制为VIP权益,开通VIP直接复制

评论3