D-EM算法：一种快速收敛的新统计方法

需积分: 12 196 浏览量更新于2024-08-07 收藏 304KB PDF 举报

"《[agr]-EM算法及其基本性质》是由Yasuo Matsuyama发表的一篇学术论文，探讨了一种新的统计算法——D-EM算法。该算法将传统的EM算法作为D=-1的特殊情况。设计参数D的选择会影响似然最大化过程中的海森矩阵的特征值，从而导致比传统EM算法更快的收敛速度。论文提供了D-EM算法及其实践变体的收敛定理，并通过数值评估显示，与传统方法相比，其迭代次数约为三分之一，CPU时间减少一半，表现出快速的收敛性能。关键词包括：D-EM算法、D-对数似然比、收敛速度、费雪信息和海森矩阵。" 在这篇论文中，作者Yasuo Matsuyama介绍了一个新的统计学算法——D-EM算法，它是对经典期望最大化(EM)算法的一种扩展。EM算法是一种在处理含有隐藏变量的概率模型时寻找参数最大似然估计的方法，广泛应用于机器学习和统计推断领域。D-EM算法通过引入设计参数D，不仅包含了EM算法（当D设为-1时），还带来了改进的特性。关键创新在于D参数的选择直接影响到算法的收敛行为。具体来说，D参数会改变用于似然函数最大化的海森矩阵的特征值。海森矩阵是描述函数曲面二阶导数的矩阵，其特征值决定了函数的局部形状和算法的收敛速度。通过调整D，D-EM算法能够实现更快的收敛速度，这在实际应用中是非常重要的，因为它减少了计算需求，提高了效率。论文还提供了D-EM算法的基本版本及其实践变体的收敛定理，这些定理为理解算法的理论基础和在不同情况下的行为提供了理论支持。此外，通过数值实验，作者证明了D-EM算法相对于传统EM算法的显著优势，即在迭代次数减少约三分之二，且CPU时间减少一半的情况下，仍能保持快速的收敛性。这篇论文提出的D-EM算法为统计建模和机器学习提供了一个有潜力的工具，尤其是在处理大规模数据集和复杂模型时，其快速收敛的特性可能会成为一种优选方法。关键词涵盖了算法的核心概念，包括D-EM算法的设计原理、性能指标以及相关的数学工具，如对数似然比、收敛速度和信息理论中的费雪信息。这些概念对于深入理解和应用D-EM算法至关重要。

In the D-EM algorithm, the above likelihood ratios are

always used instead of logarithmic likelihood.

3.2. Maximization of incomplete-data

likelihood

The model which best explains the appearance of the

observable data y satisfies

Here, < is a Euclidean space or its subset which has the

same dimension as \ and M. This maximization is equiva-

lent to that of the incomplete-data D-log likelihood ratio.

The D-EM algorithm given in the next section is a method

for performing this maximization by using the conditional

expectation of the complete-data D-log likelihood ratio. In

the case of D = 1, this maximization is reduced to that of

log-likelihood. This is the traditional EM algorithm. In this

paper, therefore, it is often referred to the log-EM algo-

rithm.

4. The D-EM Algorithm and Its Family

4.1. Basic equality for the

-EM algorithm

First, we define the expectation of the complete-data

D-log likelihood ratio conditioned by the observed data:

This Q

D

-function has the following property:

Next, we compute the D-divergence between two condi-

tional probability densities p



y, \ and p

x|y, M

by using Eq. (1). After mathematical computation using the

Bayes formula, the following equation is obtained. This is

the basic equality for the D-EM algorithm:

It is important to observe that the second term on the

right-hand side is nonnegative for D < 1. Therefore, the

incomplete-data D-log likelihood ratio of the left-hand side

is always nonnegative if the Q

D

-function of the right-hand

side is kept nonnegative on the update of M. Then, the D-EM

algorithm is obtained as follows.

[D-EM algorithm I]

[E-step] Compute (9) analytically. In the case of D =

1, the computation is performed using the logarithm.

[M-step] For D < 1, obtain \





which maximizes

(9).

[U-step] Check to see if the algorithm is converged.

If not, then replace

and go back to the E-step.

Next, we extract the core part of the Q

D

-function as

Since there is a relationship

the

-EM algorithm is classified into the following cases.

[

-EM algorithm II]

[E-step] Compute (11) mathematically. In the case of

= 1, the usual logarithm is used.

[M-step]

(i) For the case of





, compute



which

maximizes (11).

(ii) For the case of

= 1, compute



which maxi-

mizes E

[logp

]

(iii) For the case of

< 1, compute



which

minimizes (11).

[U-step] Check to see if the algorithm is converged.

If not, then replace



and go back to the E-step.

The convergence properties of the

-EM algorithm

are discussed in the next section together with generalized

algorithms.

4.2. The D-GEM algorithm

Dempster and colleagues [1] presented the GEM

algorithm (Generalized EM algorithm) which uses incre-

mental updates possibly weaker than the maximization per

se. Such a GEM algorithm is important for the case that

exact arg max cannot be obtained analytically. The D-

GEM algorithm using the D-log likelihood ratio is de-

scribed as follows:

[D-GEM algorithm]

[M-step] For D < 1, compute the update value



 < such that

Since the D-EM algorithm satisfies (12), it is a special

case of the D-GEM algorithm. On the convergence of the

D-GEM algorithm, we have two theorems. Theorem 1

states the convergence of the incomplete-data D-log likeli-

(9)

(10)

(11)

(12)

剩余11页未读，继续阅读

weixin_38678022

粉丝: 2

D-EM算法：一种快速收敛的新统计方法

[agr]-EM算法及其基本性质

AGR - MD - CX1 无线接收模块

AGR-开源

AGR---MD---AX1.rar_433Mhz遥控开关_433mhz_FSK遥控_fsk无线遥控_无线遥控

AGR-MD-AX2无线接收模块

glibc-langpack-agr-2.28-170.el8.aarch64.rpm

glibc-langpack-agr-2.28-164.el8.aarch64.rpm

glibc-langpack-agr-2.28-155.el8.aarch64.rpm

glibc-langpack-agr-2.28-162.el8.aarch64.rpm

glibc-langpack-agr-2.28-158.el8.aarch64.rpm

最新资源