D-EM算法：一种快速收敛的统计建模新方法

需积分: 13 35 浏览量更新于2024-08-07 收藏 304KB PDF 举报

"本文介绍了D-EM算法，这是一种新的统计算法，扩展了传统的EM算法，并具有更快的收敛速度。D-EM算法的设计参数D对Hessian矩阵的似然最大化特征值有直接影响，从而影响算法的收敛性能。作者Yasuo Matsuyama在文中提出并分析了这个算法，并给出了基本D-EM算法及其实际变体的收敛定理。数值评估显示，相比于传统EM算法，D-EM算法在减少迭代次数近三分之一和CPU时间减半的情况下仍能实现快速收敛。文章关键词包括D-EM算法、D-log似然比、收敛速度、渔民信息和Hessian矩阵。" 在统计学和机器学习领域，EM（期望最大化）算法是一种广泛使用的算法，用于估计含有未观察到的数据（即隐藏变量）的模型参数。传统的EM算法在处理含有缺失数据的问题时，通过交替执行期望（E）步骤和最大化（M）步骤来迭代优化模型参数，直到收敛。E步骤计算在当前参数估计下的期望值，而M步骤则最大化这些期望值以更新参数。 D-EM算法是EM算法的一种改进版本，它引入了一个设计参数D，这个参数可以调整算法的行为。当D等于1时，D-EM算法退化为传统的EM算法。然而，通过选择不同的D值，D-EM算法可以改变Hessian矩阵的特征值分布，这直接影响了算法的收敛速度。Hessian矩阵是梯度的二阶导数矩阵，用于描述函数在局部的曲率信息，对于优化问题的快速收敛至关重要。 Matsuyama的研究表明，D-EM算法的收敛速度显著快于传统的EM算法。这意味着在处理同样大小的数据集时，D-EM算法需要更少的迭代次数，从而节省了计算资源。这对于大规模数据集和计算效率要求高的应用来说尤其重要，因为它减少了计算时间和计算成本。此外，论文中还提到了“渔民信息”，这是用来描述模型参数不确定性的量，与Hessian矩阵有关。在优化过程中，渔民信息可以帮助评估参数估计的稳定性和精度。通过对D的合理选择，D-EM算法可以更有效地利用渔民信息，加快收敛。 D-EM算法提供了一种新的策略来优化EM算法的性能，特别是在处理大量数据和复杂模型时，其优势更为明显。通过调整D值，研究者可以平衡算法的收敛速度和模型的复杂性，以适应不同场景的应用需求。这项工作为统计推断和机器学习领域的算法优化提供了有价值的理论依据和实践指导。

In the D-EM algorithm, the above likelihood ratios are

always used instead of logarithmic likelihood.

3.2. Maximization of incomplete-data

likelihood

The model which best explains the appearance of the

observable data y satisfies

Here, < is a Euclidean space or its subset which has the

same dimension as \ and M. This maximization is equiva-

lent to that of the incomplete-data D-log likelihood ratio.

The D-EM algorithm given in the next section is a method

for performing this maximization by using the conditional

expectation of the complete-data D-log likelihood ratio. In

the case of D = 1, this maximization is reduced to that of

log-likelihood. This is the traditional EM algorithm. In this

paper, therefore, it is often referred to the log-EM algo-

rithm.

4. The D-EM Algorithm and Its Family

4.1. Basic equality for the

-EM algorithm

First, we define the expectation of the complete-data

D-log likelihood ratio conditioned by the observed data:

This Q

D

-function has the following property:

Next, we compute the D-divergence between two condi-

tional probability densities p



y, \ and p

x|y, M

by using Eq. (1). After mathematical computation using the

Bayes formula, the following equation is obtained. This is

the basic equality for the D-EM algorithm:

It is important to observe that the second term on the

right-hand side is nonnegative for D < 1. Therefore, the

incomplete-data D-log likelihood ratio of the left-hand side

is always nonnegative if the Q

D

-function of the right-hand

side is kept nonnegative on the update of M. Then, the D-EM

algorithm is obtained as follows.

[D-EM algorithm I]

[E-step] Compute (9) analytically. In the case of D =

1, the computation is performed using the logarithm.

[M-step] For D < 1, obtain \





which maximizes

(9).

[U-step] Check to see if the algorithm is converged.

If not, then replace

and go back to the E-step.

Next, we extract the core part of the Q

D

-function as

Since there is a relationship

the

-EM algorithm is classified into the following cases.

[

-EM algorithm II]

[E-step] Compute (11) mathematically. In the case of

= 1, the usual logarithm is used.

[M-step]

(i) For the case of





, compute



which

maximizes (11).

(ii) For the case of

= 1, compute



which maxi-

mizes E

[logp

]

(iii) For the case of

< 1, compute



which

minimizes (11).

[U-step] Check to see if the algorithm is converged.

If not, then replace



and go back to the E-step.

The convergence properties of the

-EM algorithm

are discussed in the next section together with generalized

algorithms.

4.2. The D-GEM algorithm

Dempster and colleagues [1] presented the GEM

algorithm (Generalized EM algorithm) which uses incre-

mental updates possibly weaker than the maximization per

se. Such a GEM algorithm is important for the case that

exact arg max cannot be obtained analytically. The D-

GEM algorithm using the D-log likelihood ratio is de-

scribed as follows:

[D-GEM algorithm]

[M-step] For D < 1, compute the update value



 < such that

Since the D-EM algorithm satisfies (12), it is a special

case of the D-GEM algorithm. On the convergence of the

D-GEM algorithm, we have two theorems. Theorem 1

states the convergence of the incomplete-data D-log likeli-

(9)

(10)

(11)

(12)

剩余11页未读，继续阅读

weixin_38715008

粉丝: 5
资源: 1016

D-EM算法：一种快速收敛的统计建模新方法

AGR-开源

The [agr]-EM algorithm and its basic properties

AGR---MD---AX1.rar_433Mhz遥控开关_433mhz_FSK遥控_fsk无线遥控_无线遥控

glibc-langpack-agr-2.28-162.el8.aarch64.rpm

glibc-langpack-agr-2.28-161.el8.aarch64.rpm

glibc-langpack-agr-2.28-155.el8.aarch64.rpm

glibc-langpack-agr-2.28-158.el8.aarch64.rpm

glibc-langpack-agr-2.28-160.el8.aarch64.rpm

glibc-langpack-agr-2.28-170.el8.aarch64.rpm

glibc-langpack-agr-2.28-167.el8.aarch64.rpm

最新资源