低维空间下Nelder-Mead单纯形法收敛特性研究

5星 · 超过95%的资源需积分: 16 11 浏览量更新于2024-07-23 1 收藏 554KB PDF 举报

Nelder-Mead非线性单纯型方法是一种广泛应用于多维无约束优化问题的直接搜索算法，首次发表于1965年。尽管它在实际工程优化中备受青睐，但其理论性质的研究相对较少。本文主要关注该方法在低维度（一维和二维）中的收敛特性。在一维情况下，我们证明了Nelder-Mead算法对于严格凸函数确实具有收敛性。严格凸函数意味着在其定义域内，任何两点之间的局部最小值都是全局最小值，这使得算法能够在寻找最小值点上表现出稳定的行为。当优化问题只在一维时，由于问题的简单性，算法可以确保找到全局最优解。然而，进入二维后情况变得复杂。尽管在理论上存在困难，论文展示了对某些严格凸函数在二维中的有限收敛结果。这意味着算法在特定条件下能够接近或达到局部最小值，但并不保证一定能找到全局最优解。这一点与一维的情况形成对比，表明维度的增加可能影响算法的全局收敛能力。值得注意的是，McKinnon给出的一个反例揭示了一个有趣的现象：存在一个严格凸的二维函数族和一组初始条件，即使在这样的限制条件下，Nelder-Mead算法也可能收敛到非最优解。这表明算法在高维空间中的行为并非总是直观可预测的，特别是对于非平凡的函数集。目前，关于Nelder-Mead算法在二维或其他更特殊凸函数集合中的全局收敛性，尚未有明确的理论证明。这仍然是该领域的一个研究挑战，表明进一步的理论研究对于理解和改进该算法在实际应用中的性能至关重要。总结来说，本文通过具体分析和实验，揭示了Nelder-Mead算法在低维度中的收敛特性，以及在高维度中可能出现的问题和未解决的理论空白。这对于理解和使用该算法的用户来说，提供了重要的理论参考，同时也为算法的改进和扩展提出了新的思考方向。

118 J. C. LAGARIAS, J. A. REEDS, M. H. WRIGHT, AND P. E. WRIGHT

(k+1)

= f

(k)

and x

(k+1)

= x

(k)

,j<k

∗

;

(k+1)

∗

(k)

∗

and x

(k+1)

∗

6= x

(k)

∗

;(2.9)

(k+1)

= f

(k)

j−1

and x

(k+1)

= x

(k)

j−1

,j>k

∗

Thus the vector (f

(k)

,...,f

(k)

n+1

) strictly lexicographically decreases at each nonshrink

iteration.

For illustration, suppose that n = 4 and the vertex function values at a nonshrink

iteration k are (1, 2, 2, 3, 3). If f(v

(k)

) = 2, the function values at iteration k + 1 are

(1, 2, 2, 2, 3), x

(k+1)

= v

(k)

, and k

∗

= 4. This example shows that, following a single

nonshrink iteration, the worst function value need not strictly decrease; however, the

worst function value must strictly decrease after at most n + 1 consecutive nonshrink

iterations.

2.2. Matrix notation. It is convenient to use matrix notation to describe

Nelder–Mead iterations. The simplex ∆

can be represented as an n ×(n + 1) matrix

whose columns are the vertices

∆



(k)

··· x

(k)

n+1





(k)

n+1



, where B



(k)

··· x

(k)



For any simplex ∆

in R

, we deﬁne M

as the n × n matrix whose jth column

represents the “edge” of ∆

between x

(k)

and x

(k)

n+1

≡



(k)

− x

(k)

n+1

(k)

− x

(k)

n+1

··· x

(k)

− x

(k)

n+1



= B

− x

(k)

n+1

,(2.10)

where e =(1, 1,...,1)

. The n-dimensional volume of ∆

is given by

vol(∆

|det(M

.(2.11)

A simplex ∆

is nondegenerate if M

is nonsingular or, equivalently, if vol(∆

) > 0.

The volume of the simplex obviously depends only on the coordinates of the vertices,

not on their ordering. For future reference, we deﬁne the diameter of ∆

diam(∆

) = max

i6=j

(k)

− x

(k)

where k·k denotes the two-norm.

During a nonshrink iteration, the function is evaluated only at trial points of the

form

(k)

(τ):=

(k)

+ τ (

(k)

− x

(k)

n+1

)=(1+τ )

(k)

− τ x

(k)

n+1

,(2.12)

where the coeﬃcient τ has one of four possible values:

τ = ρ (reﬂection); τ = ρχ (expansion);(2.13)

τ = ργ (outside contraction); τ = −γ (inside contraction).

In a nonshrink step, the single accepted point is one of the trial points, and we let

denote the coeﬃcient associated with the accepted point at iteration k. Thus the

new vertex v

(k)

produced during iteration k, which will replace x

(k)

n+1

, is given by

(k)

= z

(k)

(τ

). We sometimes call τ

the type of move for a nonshrink iteration k.

Downloaded 07/06/14 to 222.195.132.86. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

PROPERTIES OF NELDER–MEAD 119

During the kth Nelder–Mead iteration, (2.12) shows that each trial point (reﬂection,

expansion, contraction) may be written as

(k)

(τ)=∆

t(τ), where t(τ)=



1+τ

, ...,

1+τ

, −τ



.(2.14)

Following the kth Nelder–Mead iteration, the (unordered) vertices of the next

simplex are the columns of ∆

, where S

is an (n +1)× (n + 1) matrix given by





(1 + τ

)

−τ





for a step of type τ and by



1(1− σ)e

0 σI



for a shrink step, with 0 being an n-dimensional zero column and I

being the n-

dimensional identity matrix. After being ordered at the start of iteration k + 1, the

vertices of ∆

k+1

satisfy

∆

k+1

=∆

, with T

= S

,(2.15)

where P

is a permutation matrix chosen to enforce the ordering and tie-breaking

rules (so that P

depends on the function values at the vertices).

The updated simplex ∆

k+1

has a disjoint interior from ∆

for a reﬂection, an

expansion, or an outside contraction, while ∆

k+1

⊆ ∆

for an inside contraction or a

shrink.

By the shape of a nondegenerate simplex, we mean its equivalence class under

similarity, i.e., ∆ and λ∆ have the same shape when λ>0. The shape of a simplex

is determined by its angles, or equivalently by the singular values of the associated

matrix M (2.10) after scaling so that ∆ has unit volume. The Nelder–Mead method

was deliberately designed with the idea that the simplex shapes would “adapt to the

features of the local landscape” [6]. The Nelder–Mead moves apparently permit any

simplex shape to be approximated—in particular, arbitrarily ﬂat or needle-shaped

simplices (as in the McKinnon examples [5]) are possible.

3. Properties of the Nelder–Mead algorithm. This section establishes var-

ious basic properties of the Nelder–Mead method. Although there is a substantial

level of folklore about the Nelder–Mead method, almost no proofs have appeared in

print, so we include details here.

3.1. General results. The following properties follow immediately from the

deﬁnition of Algorithm NM.

1. A Nelder–Mead iteration requires one function evaluation when the iteration

terminates in step 2, two function evaluations when termination occurs in step 3 or

step 4, and n + 2 function evaluations if a shrink step occurs.

2. The “reﬂect” step is so named because the reﬂection point x

(2.4) is a

(scaled) reﬂection of the worst point x

n+1

around the point

x on the line through

n+1

and

x. It is a genuine reﬂection on this line when ρ = 1, which is the standard

choice for the reﬂection coeﬃcient.

3. For general functions, a shrink step can conceivably lead to an increase in

every vertex function value except f

, i.e., it is possible that f

(k+1)

(k)

for 2 ≤ i ≤

n + 1. In addition, observe that with an outside contraction (case 4a), the algorithm

takes a shrink step if f(x

) >f(x

), even though a new point x

has already been

found that strictly improves over the worst vertex, since f(x

) <f(x

n+1

Downloaded 07/06/14 to 222.195.132.86. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

剩余35页未读，继续阅读

wangxiaomin88

粉丝: 16

低维空间下Nelder-Mead单纯形法收敛特性研究

nelder-mead算法

nelder-mead.zip_The Various_nelder mead matlab

On Local and Global Convergence of a Nonsmooth Newton-type Method for Nonlinear Semidefinite Programs

Convergence Analysis on a Derivative-Free Descent Method for Nonlinear Complementarity Problems

The convergence guarantees of a non-convex approach for sparse recovery using regularized least squares

Convergence Of Peer-To-Peer And Grid Computing

Convergence rate of the Asymmetric Deuant-Weisbuch Dynamics

Strong Convergence Properties of Jamison Weighted Sums of ruo{~}-Mixing Random Sequences

Convergence analysis of an adaptive finite element method for the elasto-plastic torsion problem

Convergence Rates of the Distributions of Error Variance Estimates in Linear Models" (1983年)

最新资源