基于神经网络的未知非线性系统有限时间最优控制方法

26 浏览量更新于2024-08-29 收藏 484KB PDF 举报

本文主要探讨了一种基于神经网络的有限时间最优控制方法，针对一类未知非线性系统的控制问题。该研究发表在2014年的《SoftComput》期刊上，具有显著的学术价值，其DOI为10.1007/s00500-013-1170-z。论文的创新之处在于提出了一种结合自适应动态编程（ADP）算法的新型策略，这种方法利用单隐藏层前馈神经网络（SLFN）与极端学习机（ELM）技术来构建系统动力学的数据驱动辨识器。通过这种方式，即使对系统内部结构不完全了解，也能根据输入-输出数据有效地估计和控制系统的动态行为。在该方法中，两个SLFN分别用于ADP过程中的性能指标函数和最优控制律的逼近。这两个SLFN的使用有助于简化迭代过程，并确保每个迭代步骤都能得到更精确的优化结果。作者采用了一种迭代的方式，通过不断的训练和优化，SLFN能够逐步逼近系统的实际性能，从而实现有限时间内的最优控制。此外，论文还强调了在实际应用中的有效性，通过一个具体的仿真例子来展示这种基于神经网络的控制策略的有效性和优越性。该示例可能包括非线性系统的模型、控制目标设定、SLFN的训练过程、以及控制结果对比分析，以证明新方法在提升控制效率和系统稳定性方面的优势。关键词包括"自适应动态编程", "神经网络", "有限时间最优控制", "未知非线性系统", 和"极端学习机"，这些关键词揭示了研究的核心内容和技术手段。这篇研究论文提供了一个新颖的解决方案，将神经网络技术与ADP融合，适用于处理复杂的未知非线性系统控制问题，对于理论研究和实际工程应用都有很高的实用价值。通过本文的研究，可以进一步推动非线性系统控制领域的理论发展和技术进步。

Soft Comput (2014) 18:1645–1653

DOI 10.1007/s00500-013-1170-z

METHODOLOGIES AND APPLICATION

Neural-network-based approach to ﬁnite-time optimal control

for a class of unknown nonlinear systems

Ruizhuo Song · Wendong Xiao · Qinglai Wei ·

Changyin Sun

Published online: 10 November 2013

Abstract This paper proposes a novel ﬁnite-time optimal

control method based on input–output data for unknown non-

linear systems using adaptive dynamic programming (ADP)

algorithm. In this method, the single-hidden layer feed-

forward network (SLFN) with extreme learning machine

(ELM) is used to construct the data-based identiﬁer of the

unknown system dynamics. Based on the data-based iden-

tiﬁer, the ﬁnite-time optimal control method is established

by ADP algorithm. Two other SLFNs with ELM are used in

ADP method to facilitate the implementation of the iterative

algorithm, which aim to approximate the performance index

function and the optimal control law at each iteration, respec-

tively. A simulation example is provided to demonstrate the

effectiveness of the proposed control s cheme.

Keywords Adaptive dynamic programming ·

Approximate dynamic programming · Unknown nonlinear

systems · Optimal control · Data-based

1 Introduction

The linear optimal control problem with a quadratic cost

function is probably the most well-known control problem

(Duncan et al. 1999; Gabasov et al. 2000), and it can be trans-

Communicated by D. Liu.

R. Song · W. Xiao · C. Sun

School of Automation and Electrical Engineering, University

of Science and Technology Beijing, Beijing 100083, China

Q. Wei (

)

The State Key Laboratory of Management and Control for Complex

Systems, Institute of Automation, Chinese Academy of Sciences,

Beijing 100190, China

e-mail: rzsong@126.com

lated into Riccati equation. While the optimal control of non-

linear systems is usually a challenging and difﬁcult problem

(Jin et al. 2012; Zhang et al. 2011e). Furthermore, comparing

with the known system dynamics case, it is more intractable

to solve the optimal control problem of the unknown sys-

tem dynamics. Generally speaking, most actual systems are

nearly far too complex to present the perfect mathemati-

cal models. Whenever no model is available t o design the

system controller nor is easy to produce, a standard way is

resorting to data-based techniques (Guardabassi and Savaresi

2000): (1) on the basis of input-output data, the model of the

unknown system dynamics is identiﬁed; (2) on the basis of

the estimated model of the system dynamics, the controller

is designed by model-based design techniques.

It is well known that neural network is an effective tool to

implement intelligent identiﬁcation based on input–output

data, due to the properties of nonlinearity, adaptivity, self-

learning and fault tolerance (Jagannathan 2006; Yu 2009;

Fernández-Navarro et al. 2013; Richert et al. 2013; Maji et

al. 2013). In which, single-hidden-layer feed-forward neural

network (SLFN) is one of the most useful types (Huang et

al. 2006b). Hornik (1991) proved that if the activation func-

tion is continuous, bounded, and non-constant, then continu-

ous mappings can be approximated by SLFNs with additive

hidden nodes over compact input sets. Leshno et al. (1993)

improved the results of Hornik (1991) and proved that SLFNs

with additive hidden nodes and with a non-polynomial acti-

vation function can approximate any continuous target func-

tions. In Huang et al. (2006b) it is proven in theory that SLFNs

with randomly generated additive and a broad type of acti-

vation functions can universally approximate any continu-

ous target functions in any compact subset of the Euclidean

space. For SLFN training, there are three main approaches:

(1) gradient-descent based, for example back-propagation

(BP) method (Zhang et al. 2008); (2) least square based, for

123

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38587130

粉丝: 4
资源: 937

基于神经网络的未知非线性系统有限时间最优控制方法

LSTM-Neural-Network-for-Time-Series-Prediction-master.rar

A-Neural-Network-Approach-to-Nonlinear-Model-Pred_model predicti

Fuzzy Control Systems

Application of MATLAB Linear Programming in Energy Systems

Application of MATLAB in Environmental Sciences: Case Analysis and Exploration of Optimization ...

Solving Differential Equations with ode45: A Treasure in Data Science and Machine Learning, ...

【Fundamentals】Detailed Explanation of MATLAB Toolbox: Optimization Toolbox

【Fundamentals】 Detailed Explanation of Gradient Descent Algorithm and MATLAB Code

【Advanced Tutorial on MATLAB Signal Processing】: Design of Adaptive Filters

MATLAB Genetic Algorithm vs. Conventional Methods: A Deep Comparative Study Revealing Advantages

最新资源