梯度优化下的样本最大似然估计：吸引域分析

39 浏览量更新于2024-08-26 收藏 143KB PDF 举报

"基于梯度优化的样本最大似然估计中的吸引力域*" 在现代统计学和信号处理领域，样本最大似然（SML）估计是一种常用的方法，特别适用于误差不变（EIV）系统的识别。EIV系统是那些观测数据中存在随机误差的系统，使得直接对参数进行估计变得复杂。SML方法通过最小化基于平均输入输出数据和样本噪声方差构建的成本函数来求解最优参数估计。然而，基于梯度的优化算法在寻找最优解时可能会陷入局部最小值，导致估计不准确。本研究深入探讨了SML成本函数的吸引力域概念，这是解决局部收敛问题的关键。吸引力域是指一个区域，当初始参数设置在这个区域内时，梯度优化算法会朝着全局最小值的方向收敛。通过对无噪声版本的分析，研究人员能够等效地学习到目标函数的渐近收敛性质，这有助于理解和改善算法的全局搜索性能。文章指出，在特定的结构下，存在一些特殊的吸引力域，这些域能够保证全局最小值被包含其中。这意味着，只要算法的初始化参数选择得当，位于这些特殊吸引力域内，就可以有效地引导算法找到全局最优解，而非仅是局部最小值。这对于确保估计的准确性和鲁棒性至关重要，尤其是在实际应用中，如控制系统、通信系统和图像处理等领域。关键词包括：EIV系统、最大似然估计、基于梯度的优化、全局和局部收敛以及吸引力域。这些关键词揭示了研究的核心内容，即通过理解梯度优化过程中的动态行为和寻找合适的初始化策略，来提高SML方法在EIV系统估计中的性能。这项工作为改进基于梯度的优化算法提供了理论依据，并为实际应用中的参数估计问题提供了新的思路和工具。

Attraction Domain in Gradient

Optimization-based Sample Maximum

Likelihood Estimation

⋆

Yiqun Zou

∗

Xiafei Tang

∗∗

∗

School of Information Science and Engineering, Central South

University, Changsha, 410083 China(e-mail:yiqunzou@csu.edu.cn)

∗∗

School of Electrical and Information Engineering, Changsha

University of Science and Technology, Changsha, China(e-mail:

xiafei.tang@csust.edu.cn)

Abstract: Sample maximum likelihood(SML) method is frequently used to identify errors-in-

variables(EIV) system. It generates the estimate through minimizing relevant cost function built

on the mean input-output data and sample noise variances. To help gradient-based algorithm

overcome local convergence, we examine the attraction domain for the SML cost. It is shown

in this paper that the asymptotic convergence properties of the objective can be learned

equivalently by the noiseless version. Moreover we present some special attraction domains

that contain the global minimum under certain structures. For the particular models, careful

initialization locating in the same domain leads the algorithm to ﬁnd the global minimum.

Keywords: EIV system, Maximum likelihood estimation, Gradient-based optimization, Global

and local convergence, Attraction domain

1. INTRODUCTION

Identiﬁcation of the dynamics for errors-in-variables(EIV)

system has been examined by researchers for a long

time. Many methods have been developed to handle this

problem. For example, total least squares is described

explicitly in Van Huﬀel and Vandewalle (1991). As an

alternative, basic instrumental-variable approach and its

extended version giving consistent EIV estimates asymp-

totically in time domain are designed in S¨oderstr¨om (1981)

and S¨oderstr¨om and Stoica (1983). The statistic proper-

ties of the two methods are analyzed and compared in

S¨oderstr¨om and Mahata (2002). It is suggested therein

that their asymptotic covariance matrices are similar in

terms of mathematic form. The Koopmans-Levin(KL)

method is presented in Guidorzi (1981) to estimate the

noise variance matrix based on known variance ratio

between input and output noise. Assuming the white-

ness of input and output noise, the Frisch scheme is de-

signed in Beghelli, Guidorzi and Soverini (1990). Both KL

and Frisch scheme can be seen as special forms of bias

compensation least squares(BCLS) method(Zheng, 2002).

Other methods like the prediction error method(PEM),

frequency domain approaches, and methods based on

higher order moment statistics are discussed separately in

Pintelon et al (1992). S¨oderstr¨om et al (2010) compares

the statistical accuracy of estimates between time-domain

maximum likelihood(ML) and frequency-domain sample

maximum likelihood(SML). Interested readers are recom-

mended to see S¨oderstr¨om (2007) for a thorough survey.

⋆

This work is supported by NSFC(Projects 61403427).

S¨oderstr¨om (2006) discusses time domain PEM and ML

method in details. The estimate criteria for both methods

are on the basis of prediction error sequences. Compared

with PEM, ML estimator is more accurate with a lower

covariance matrix for the parameter errors. The main

drawback of PEM and ML method in time domain is

a Riccati equation needs to be solved at each iterative

step in the derivation of the prediction error innova-

tion. This complicates the optimization process. Pintelon

et al (1992) discusses frequency-domain maximum likeli-

hood(FML) estimation for EIV models provided the exact

(co)variances of input and output noise. Schoukens et al

(1997) further transforms FML into sample maximum like-

lihood(SML) where the mean of input-output and sample

values replace real measurements and exact (co)variances

in FML. Both FML and SML estimators can be developed

via the minimization of relevant costs. Gradient-related al-

gorithm is often suggested(Pintelon and Schoukens, 2001)

to achieve this goal with good starting point generated

by for example BCLS scheme. If the optimization begins

with poor initialization, the search may get stuck by the

local minimum. Local minimum in this paper particularly

means the ‘false’ non-global minimum in the landscape.

The existence of local minimum relates to many factors,

for instance, model type, the structure of input, the mag-

nitude of signal-to-noise ratio(SNR). For output-error only

identiﬁcation in which the noise just exists at the output,

how to tackle such problem has been described in various

literature.

Astr¨om and S¨oderstr¨om (1974) points out there

is no local minimum in the cost function regardless of

the input while S¨oderstr¨om (1975) presents that white

noise as input signal leads to global convergence for OE

models. Zou and Heath (2012) summarizes these results

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38621082

粉丝: 9
资源: 948

梯度优化下的样本最大似然估计：吸引域分析

maxLik：：：constrOptim2_最大似然估计_

MLE_MLE_最大似然估计_指数信号的最大似然估计_指数信号估计_频率估计.zip

MLE_MLE_最大似然估计_指数信号的最大似然估计_指数信号估计_频率估计_源码.rar.rar

基于最大似然估计的自适应阈值视频被动取证 (2013年)

最大似然估计.rar

神经网络梯度优化：贝叶斯学习中似然最大化策略

最大似然估计与凸优化求解源代码解析

最大似然估计在指数信号频率估计中的应用

神经网络中的梯度上升法与极大似然估计

参数估计：最大似然估计与贝叶斯估计

最新资源