自适应动态规划在非线性系统镇定中的应用：应对未知执行器饱和度

需积分: 5 145 浏览量更新于2024-07-14 1 收藏 1.04MB PDF 举报

"这篇研究论文探讨了如何使用自适应动态规划算法解决执行器饱和度未知的非线性系统的镇定问题。控制策略包括一个非线性的名义最优控制器和基于神经网络（NN）的前馈饱和补偿器。对于没有执行器饱和的名义系统，通过建立一个批评型神经网络来处理汉密尔顿-雅各比-贝尔曼方程，从而在线获得近似名义最优控制策略。然后，通过使用基于神经网络的前馈控制环路来补偿被认为是饱和非线性的未知执行器饱和。论文证明了这种方法的稳定性，并在非线性动力学系统中进行了应用。" 这篇研究论文的核心内容集中在解决一类特殊的控制系统问题，即那些由于执行器饱和导致性能受限的非线性系统。执行器饱和是指控制系统中的执行机构达到其物理限制，无法再进一步改变输出，这在实际工程系统中是常见的问题。论文提出的解决方案基于自适应动态规划（Adaptive Dynamic Programming, ADP），这是一种优化控制策略的方法，它可以在系统运行过程中逐步学习和改进控制策略。 ADP通常包括两个主要部分：一是评价函数，它衡量系统状态的优劣，对应于解决的汉密尔顿-雅各比-贝尔曼（Hamilton-Jacobi-Bellman, HJB）方程；二是控制策略，它决定了系统的下一步行动。在本研究中，对于没有执行器饱和的系统，通过批评型神经网络（Critic NN）解决了HJB方程，从而得到近似的最优控制策略。然而，当考虑执行器饱和时，问题变得更加复杂。论文提出使用一个基于神经网络的前馈控制环路来补偿这种饱和非线性。这个NN模型能够学习和预测执行器的饱和行为，通过前馈控制信号来抵消饱和的影响，使得系统能够在饱和约束下保持稳定。论文的重点在于设计和分析这种控制策略的稳定性。作者们证明了在考虑未知执行器饱和的情况下，所提出的控制策略能够保证系统的全局渐近稳定性。这意味着，尽管存在不确定性，系统的状态将随着时间推移趋向于一个稳定的平衡点。这项研究为解决执行器饱和的非线性系统提供了一个新颖且实用的控制框架，利用自适应动态规划和神经网络技术克服了执行器饱和带来的挑战。这一方法对实际工程中的控制系统设计具有重要的理论和实践意义，特别是在那些执行器性能受限的复杂非线性系统中。

Adaptive dynamic programming-based stabilization 2091

able actuator saturation, this paper presents the

ADP-based control methods for nonlinear systems

subject to unknown actuator saturation. Thus, the

developed control method avoids any priori knowl-

edge of actuator saturation.

3. The optimal control is derived depending only on

critic NN, rather than dual- or triple-NN-based

architecture. Thus, it reduces the computational

burden of traditional adaptive critic designs [43,

49].

The structure of this paper is organized as follows:

In Sect. 2, the problem statement is provided. In Sect. 3,

the ADP-based online nominal optimal control is devel-

oped for nominal nonlinear systems. Then, a NN-based

saturation compensator is developed for eliminating

the negative affection of unknown actuator saturation.

In the following, the stability analysis is presented. In

Sect. 4, two numerical examples are employed to verify

the effectiveness of the proposed method. Finally, the

conclusion is drawn in Sect. 5.

2 Problem statement

The considered nominal continuous-time nonlinear

systems can be described as

˙x = f (x) + g(x)u, (1)

where x ∈ R

and u ∈ R

are the system state and

control input vectors, respectively. f (·) and g(·) are

assumed to be locally Lipschitz and differentiable in

their arguments such that the solution x(t ) to nonlinear

system (1) is unique for any given initial state x(0) = x

with f (0) = 0. Nonlinear system (1) is stable in the

sense that there exists a continuous control u which

stabilizes the system asymptotically.

In order to better adapt practical control require-

ments, we are concerned with the stabilization prob-

lems for continuous-time nonlinear systems subject to

unknown actuator saturation as

˙x = f (x) + g(x)τ, (2)

where τ =[τ

,τ

,...,τ

]

∈ R

is the saturated

actuator output vector, which is the actual applied con-

trol input of (2). It slopes between its lower and upper

limits, i.e.,

= sat(u

) =

⎧

⎨

⎩

i max

, u

> u

i max

, u

i min

≤ u

i max

i min

, u

< u

i min

(3)

where i = 1, 2,...,m, and u

i max

and u

i min

are the

unknown upper and lower limit bounds, respectively.

That is to say, actuator saturation occurs if the com-

manded input u

falls outside of the set [u

i min

, u

i max

and the control input cannot be implemented to the

device totally.

The main purpose of this paper is to propose a

NN compensation-based ADP stabilization scheme for

nonlinear systems subject to unknown actuator satura-

tion and ensure all the signals of the closed-loop non-

linear system (2) to be ultimately uniformly bounded

(UUB).

3 Online approximate optimal controller design

and stability analysis

This section is divided into three parts. The online

learning nominal optimal control scheme is presented

in the ﬁrst part for nominal system (1). Then, in the sec-

ond part, a feed-forward NN compensator is developed

to tackle the unknown actuator saturation for nonlinear

system (2). In the third part, the UUB stability of the

closed-loop nonlinear system is analyzed.

3.1 Online nominal optimal control

For nominal nonlinear system (1), a feedback control

(x) ∈ Ψ(Ω) will be derived to tackle its control

problem such that the closed-loop nonlinear system is

stable. The objective of this optimal control problem is

to ﬁnd the stabilizing nominal control u

(x) to mini-

mize the inﬁnite-horizon cost function which is given

V (x

) =



∞

U (x(s), u

(s))ds, (4)

where U (x , u

) = x

Qx + u

is the utility func-

tion, U (x, u

) ≥ 0 for all x and u

with U (0, 0) = 0,

and Q ∈ R

n×n

and R ∈ R

m×m

are positive deﬁnite

matrices. If the associated inﬁnite-horizon cost func-

tion (4) is continuously differentiable, the inﬁnitesimal

123

剩余14页未读，继续阅读

weixin_38658405

粉丝: 4
资源: 1010

自适应动态规划在非线性系统镇定中的应用：应对未知执行器饱和度

基于命令滤波的输入饱和感应电动机随机非线性系统的自适应模糊控制

不确定饱和非线性系统的截断自适应神经动态表面控制

具有饱和非线性和L 2扰动的时滞系统的自适应容错控制

给我提供一个具有输入饱和的三阶严格反馈非线性系统自适应动态面控制的matlab代码

具有饱和限幅特性非线性系统的m文件仿真代码

如何使用描述函数法分析具有饱和特性的非线性系统的稳定性？请结合相平面分析给出步骤和解释。

如何应用描述函数法与相平面分析相结合的方式，来评估具有饱和特性的非线性系统的稳定性？请详细阐述分析步骤。

具有输入饱和的不确定非线性系统matlab

非线性volterra均衡算法

python自适应调整色彩饱和度

最新资源