分布式卡尔曼滤波器在麦克风阵列网络说话人跟踪中的应用

需积分: 13 80 浏览量更新于2024-08-26 收藏 958KB PDF 举报

"本文主要探讨了在麦克风阵列网络中使用分布式卡尔曼滤波器进行说话人跟踪的方法，尤其适用于噪声和混响环境。该方法首先通过广义互相关（GCC）法估计各麦克风对之间的时延到达（TDOA），然后利用兰格维因模型作为状态方程来模拟说话人的移动，通过对TDOA模型进行线性化得到测量方程。最后，通过麦克风阵列网络中的分布式卡尔曼滤波技术，有效地估计出移动说话人的位置，并能平滑地跟踪其运动轨迹。这种方法具有可扩展性，能够适应不同的环境条件。" 正文: 在现代音频处理领域，尤其是在声源定位和语音识别的应用中，说话人跟踪是一个关键的技术挑战。尤其是在噪声和混响环境中，准确跟踪说话人的位置显得尤为重要。本文提出了一种基于分布式卡尔曼滤波器的说话人跟踪方法，该方法适用于麦克风阵列网络，能够有效提升在复杂声学环境下的定位精度。分布式卡尔曼滤波器（DKF）是一种用于分布式系统状态估计的统计方法，它将传统的卡尔曼滤波器扩展到多个传感器节点，每个节点独立估计系统状态并进行局部更新，然后通过通信网络共享信息，实现全局最优状态估计。在本文中，DKF被应用于麦克风阵列网络，用于跟踪说话人的实时位置。首先，利用广义互相关（GCC）算法来估计麦克风对之间接收到声音信号的时间差，即时延到达（TDOA）。GCC是一种强大的信号同步和相位差估计工具，特别适合于估计不同位置的麦克风接收到同一声源信号的时间差。接着，采用兰格维因模型作为状态方程，描述说话人的移动行为。兰格维因模型是一种常用的随机游走模型，可以简洁地表示物体的运动状态，如速度和位置，非常适合模拟人的自然行走模式。通过将TDOA模型线性化，可以构建出与实际TDOA相符合的测量方程，这些方程与兰格维因模型相结合，提供了关于说话人位置变化的精确描述。最后，通过分布式卡尔曼滤波器在麦克风阵列网络中进行迭代计算，不断地更新每个节点的估计状态，并整合所有节点的信息，从而得到说话人的实时位置估计。这种方法不仅提高了定位的准确性，还能确保跟踪轨迹的平滑性，减少了由于噪声和环境变化引起的估计误差。该方法为麦克风阵列网络提供了一种有效的说话人跟踪策略，尤其适用于多麦克风设置和复杂声学环境。其可扩展性和适应性使其能够在不同规模的麦克风阵列以及各种噪声条件下工作，对于声学监控、语音交互系统以及智能空间中的语音处理应用具有重要的理论价值和实际意义。

Distributed Kalman ﬁlter-based speaker tracking in microphone array

networks

Ye Tian, Zhe Chen, Fuliang Yin

⇑

School of Information and Communication Engineering, Dalian University of Technology, Dalian 116023, China

article info

Article history:

Received 8 February 2014

Received in revised form 15 August 2014

Accepted 3 September 2014

Available online 28 September 2014

Keywords:

Distributed Kalman ﬁlter

Microphone array network

Time delay of arrival

abstract

Using a microphone array network, a speaker tracking method based on distributed Kalman ﬁlter (DKF)

in a noisy and reverberant environment is proposed. Firstly, the time delay of arrival (TDOA) in each

microphone pair is estimated by the generalized cross-correlation (GCC) method. Next, the Langevin

model is used as state equation to model the speaker’s movement, meanwhile the measurement

equations with true TDOA are deduced by linearizing the TDOA model. Finally, the moving speaker’s

positions are estimated by distributed Kalman ﬁltering in a microphone array network. The proposed

method is scalable. It can obtain a trajectory of the speaker’s movement smoothly with excellent tracking

accuracy. Simulation results verify the effectiveness of the proposed method.

1. Introduction

Speaker localization and tracking with microphone arrays is

useful in many applications, including audio/video conference sys-

tem [1], smart video monitor system [2], robot, human–machine

interface, far distance speech capture and recognition, etc.

The topics of speaker localization [3–5] and speaker tracking

[6–11] have been studied for many years. However, traditional

methods usually require dedicated devices, and need to know the

positions and geometry structure of microphone arrays.

In practice, it is possible that the geometry structure of

microphone arrays is irregular and the positions of them are also

distributed randomly. The geometry structure and the positions

of microphone arrays can be obtained by self-calibration methods

[12,13]. To determine speaker’s positions in spatially irregular

microphone arrays, the distributed speaker localization methods

[14,15] were proposed recently. In [16], the global coherence ﬁeld

(GCF) method was proposed, which was deﬁned over the space of

possible sound source locations to represent the plausibility that a

sound source was active at a given point. In [17,18], the GCF was

extended to Oriented GCF (OGCF) which was allowed to estimate

both the position and the head orientation of a single active

speaker. In [19], multiple speaker localization with the GCF based

on acoustic map de-emphasis was proposed. In [14,20], the steered

response power–phase transform (SRP–PHAT) method and its

modiﬁcation were proposed, which steered the microphone array

to all potential source positions to search for the candidate source

position. In [21,22], the localization performance of the SRP–PHAT

method was signiﬁcantly improved by the selection of suitable

microphone pairs in a microphone array network. In [15], Canclini

et al. proposed a distributed speaker localization algorithm by min-

imizing a cost function, which was a fourth-order polynomial

obtained by combining hyperbolic constrains from multiple sen-

sors. However, these distributed speaker localization methods only

depend on signals in the current frame. They are not yet robust

against high room reverberation, and even fail under impulse noise

conditions, such as door shutting. Further, in these localization

methods spurious sources may be generated in noisy and reverber-

ant environments, sometimes stronger than true speech sources.

To deal with these problems, the speaker tracking methods are

used to estimate speaker’s positions, which depend on not only

the current measurement but also a series of past measurements.

In this way, a smoothed trajectory of the speaker’s movement

can be obtained robustly.

Distributed state estimate algorithms such as distributed

Kalman ﬁlter (DKF) [23,24] have received great attention recently.

In the DKF, each node in sensor networks is required to estimate

the state of a linear dynamic system by sharing data only with

its neighboring nodes each time. Being advantageous over the cen-

tralized state estimation algorithms, the DKF do not require a fuse

center and is hence robust against its failure.

In this paper, the DKF theory is introduced into a distributed

microphone array network and a DKF-based speaker tracking

method in a noisy and reverberant environment is proposed.

http://dx.doi.org/10.1016/j.apacoust.2014.09.004

⇑

Corresponding author.

E-mail addresses: y.tian@mail.dlut.edu.cn (Y. Tian), zhechen@dlut.edu.cn

(Z. Chen), ﬂyin@dlut.edu.cn (F. Yin).

Applied Acoustics 89 (2015) 71–77

Contents lists available at ScienceDirect

Applied Acoustics

journal homepage: www.elsevier.com/locate/apacoust

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38692184

粉丝: 8
资源: 933

分布式卡尔曼滤波器在麦克风阵列网络说话人跟踪中的应用

基于OpenCV卡尔曼滤波器的人脸跟踪C++实现源码.zip

基于卡尔曼滤波器的目标跟踪的实现

基于分布式卡尔曼滤波的目标跟踪方法研究_蒋敏

具有信息矩阵触发器的基于事件的分布式卡尔曼滤波器

使用卡尔曼滤波器的行人跟踪：基于卡尔曼滤波器的行人跟踪估计器-matlab开发

基于卡尔曼滤波器的雷达跟踪

基于模糊和卡尔曼滤波器的目标跟踪

2D 跟踪卡尔曼滤波器：2D 卡尔曼滤波器设计用于跟踪移动目标。-matlab开发

卡尔曼滤波器包：实现卡尔曼滤波器、扩展卡尔曼滤波器、双卡尔曼滤波器和平方根卡尔曼滤波器-matlab开发

C++基于KFC卡尔曼滤波器的人脸检测跟踪源码.zip

最新资源