自适应赛车排名免疫优化法解决多目标期望值规划

需积分: 8 136 浏览量更新于2024-07-15 收藏 880KB PDF 举报

"这篇研究论文提出了一种基于自适应赛车排名的免疫优化方法，用于解决非线性多目标期望值编程问题。该方法无需预先了解噪声分布，通过建立下界估计值来限制随机变量的样本大小，采用自适应赛车排名策略识别有价值个体，并基于新的聚合度模型构建免疫优化算法寻找ε-帕累托最优解。实验证明，这种方法在解决此类问题时具有竞争力。" 本文主要探讨了如何利用生物启发式方法，特别是免疫优化算法，解决多目标期望值规划的问题。多目标期望值编程通常涉及处理多个相互冲突的目标函数，这在实际工程和决策问题中非常常见。传统的优化方法可能无法有效地处理这种复杂性，尤其是在面临非线性和不确定性的情况下。首先，研究者引入了一个有用的下界估计，这一创新点在于它能够限制随机变量的样本大小，从而降低计算复杂性和提高算法的收敛性。这在处理含有随机因素的多目标问题时显得尤为重要，因为它允许算法在不完全了解噪声分布的情况下进行优化。接着，他们设计了一种自适应赛车排名方案。这种策略借鉴了赛车进化算法的概念，通过对当前种群中的个体进行比较和筛选，找出那些具有优良性能的个体，即“有价值的个体”。这些优质个体将获得更多的采样机会和更高的重要性权重，从而加速算法对全局最优解的探索。随后，论文提出了一个基于免疫的优化方法，该方法依赖于一个新的聚合度模型。在ε-帕累托最优框架下，该模型帮助算法找到一组接近帕累托前沿的解集，这些解代表了各种可能的权衡，而不仅仅是单一的全局最优解。ε-帕累托最优解是指那些在所有目标上都优于或等价于其他解的解，允许一定程度的偏离帕累托最优状态。最后，通过与现有优化技术的比较实验，论文证明了所提方法在效率和效果上的优势。实验结果表明，这种自适应赛车排名的免疫优化方法能够在解决多目标期望值规划问题时展现出强大的竞争力，是此类问题的一个有力求解工具。这篇研究为解决复杂、非线性多目标问题提供了新的视角和方法，特别是在处理不确定性和噪声方面展示了其独特的优势。这对于优化理论研究以及实际应用领域，如工程设计、经济决策和系统控制等，都具有重要的参考价值。

2142 K. Yang et al.

depend greatly on sufﬁciently large and ﬁxed sample sizes.

For instance, Drugan and Nowe (2013) developed a multi-

objective approach to solve the problem of multi-objective

multi-armed bandits in the ﬁeld of reinforcement learning,

based on a standard upper conﬁdence bound approach and a

Pareto dominance order relationship. Unfortunately, despite

being capable of solving the multi-objective problem directly,

such approach causes low performance efﬁciency. Addition-

ally, we have also investigated two immune-inspired multi-

objective optimization approaches to solve MEVP (Zhang

and Tu 2007b) and multi-objective chance-constrained pro-

gramming (Zhang et al. 2013b). They involve in several

design inspirations such as immune suppression, immune

selection, aging and probabilistic dominance, in which a sam-

ple allocation technique is designed to assign large sample

sizes to high-quality individuals.

3 Problem statement and preliminaries

Consider the following multi-objective expected value pro-

gramming problem of the form:

MEVP min

x∈D

f (x) = E [ f

(x, ξ ), f

(x, ξ ),..., f

(x, ξ )],

with bounded and closed domain D in R

, decision vec-

tor x in D, where ξ is a r -dimensional random vector with

unknown distribution; E [ f

(x, ξ ), f

(x, ξ )..., f

(x, ξ )]

denotes the expected value vector function

(E [ f

(x, ξ )], E [ f

(x, ξ )],...,E [ f

(x, ξ )]); E [.] is the

operator of expectation; f

(x, ξ ) is the jth nonlinear stochas-

tic subobjective function. In order to seek MEVP’s solutions,

the concept of ε-dominance (Batista et al. 2011) is usually

picked up to execute solution comparison. In other words,

for two given candidates x, y ∈ D we say that x ε-dominates

y(x ≺

y),if

E[ f

(x, ξ )]+ε ≤ E[ f

(y, ξ )], (1)

with 1 ≤ j ≤ q, and there exists k satisfying

E[ f

(x, ξ )]+ε<E[ f

(y, ξ )]. (2)

This way, x

∗

∈ D is called an ε-Pareto optimal solution, if

there is no candidate z∈ D such that z ≺

∗

. In particu-

lar, for a given ﬁnite population A, x ∈ A is said to be

an ε-nondominated individual, if there is no individual y

in A such that y ε-dominates x. Similarly, the concept of

ε-dominance above may be naturally extended into the ver-

sion of β-dominance (Trautmann et al. 2009; Eskandari and

Geiger 2009) which will be used in identifying competi-

tive individuals. In other words, x β-dominates y with given

β = (β

,β

,...,β

) and β

> 0, if inequalities (1) and (2)

are true after replacing ε by β

and β

, respectively. Corre-

spondingly, x ∈ A is said to be a β-nondominated individual,

if there is no individual y in A such that y β-dominates x.

Generally, when ξ is with known distribution F

(z), each

of the above subobjective functions can be replaced by

E [ f

(x, ξ )]=



z∈R

(x,z)dF

(z), (3)

and hence the above MEVP can be changed into an analyt-

ically deterministic multi-objective programming problem.

However, in many practical problems, the noisy information

of ξ is unknown and accordingly, the model approximation

handling method is an alternative way to solve such kind

of problem. Sample average approximation is a simple and

popular method used in coping with expected value program-

ming problems with unknown noise distributions. Therefore,

we use it to transform the above problem into the following

multi-objective sample average approximation model:

SAA min

x∈D

f (x) =



(x),

(x),...,

(x)



s.t.,

(x) =



i=1

(x, ξ

with 1 ≤ j ≤ q; m denotes the ﬁxed sampling size; ξ

1 ≤ i ≤ m,arethei.i.d samples of ξ . x is said to empirically

ε-dominate y (simply say x ≺

ˆε

y) if satisfying

(x) + ε ≤

(y) with 1 ≤ j ≤ q and

(x) + ε<

(y) for some k.

Similar to the version of ε-orβ-dominance above, x is called

an ε-orβ-empirical nondominated individual if there does

not exist any individual y in A such that y ε-orβ-dominates

x empirically.

We easily know that the set of solutions for the above

problem SAA can approach that of the MEVP above when

m is sufﬁciently large according to the law of large number.

However, in such case any optimization method will cause

expensive computational cost. Consequently, we require that

different candidates be attached different sample sizes so as

to reduce the cost of computation. Hence, the above SAA

is transformed into the following multi-objective sample-

dependent approximation (SDA) model:

SDA min

x∈D

f (x) =



(x),

(x), . . . ,

(x)



s.t.,

(x) =

m(x)



k=1

(x, ξ

), 1 ≤ j ≤ q,

where m(x) is the sample size of ξ at the point x. We next cite

the following conclusions to help us design a novel racing

ranking approach to be used in deciding those competitive

individuals in a given population.

123

剩余19页未读，继续阅读

weixin_38518638

粉丝: 3

自适应赛车排名免疫优化法解决多目标期望值规划

微免疫优化算法求解约束期望值规划：竞赛采样策略

动态免疫优化算法：自适应学习与多目标问题求解

基于自适应微分分组的大规模全局优化问题求解

基于自适应变异粒子群优化算法的移动机器人路径规划.pdf

论文研究-基于自适应多态免疫蚁群算法的TSP求解.pdf

基于自适应加权的多学科多目标设计优化 (2011年)

【优化求解】基于自适应权重和Levy飞行的改进鲸鱼优化算法matlab源码.md

基于自适应观测器的故障诊断与容错控制技术：推导过程LMI求解及Simulink仿真分析,基于自适应观测器的故障诊断与容错控制技术：推导过程LMI求解及Simulink仿真研究,基于自适应观测器的故障诊

基于自适应遗传算法的TSP问题建模求解（Java）

【优化求解】基于自适应t分布的麻雀搜索算法matlab源码.md

最新资源