波动驱动学习规则在连续时间循环神经网络控制中的应用

需积分: 10 128 浏览量更新于2024-08-07 收藏 220KB PDF 举报

"本文介绍了连续时间递归神经网络的波动驱动学习规则，并探讨了其在动力系统控制中的应用。由Kazuhisa Watanabe, Takahiro Haba, Noboru Kudo和Takahumi Oohori共同完成，他们来自北海道工业大学的工程学院。该研究提出了一种新的学习机制，通过在神经元阈值上叠加随机波动来实现学习，波动幅度的概率密度被视为时不变。通过引入辅助函数，学习规则被建立，使得神经元输出和即时误差成为概率量。学习规则基于最速下降法，旨在最小化预期平均误差。这种学习规则无需额外的反馈信号，简化了学习过程。" 文章深入探讨了连续时间循环神经网络（CTRNN）的学习机制，传统的学习规则通常依赖于复杂的反馈结构或精确的误差信号。而波动驱动学习规则则提供了一个新颖的视角，它利用随机波动来驱动网络的学习。在该方法中，每个神经元的阈值受到随机波动nj,p,t的影响，这些波动具有特定的神经元数量、输入模式数量、时间和模式长度属性。波动幅度的概率密度Nj,nj被视为恒定不变，这使得波动的统计特性可预测。为了将波动与学习关联起来，研究者引入了辅助函数gj,nj，该函数与波动概率密度的导数有关。波动nj,p,t和由此产生的神经元输出rj,p,t以及即时误差e,p,t都被视为概率变量，这使得学习过程具有一定的随机性。由此，学习规则Rji,p,t被定义，它涉及到神经元的时间常数膜电位rj和学习系数P，以及来自第i个神经元的突触权重wjif的变化。理论分析表明，通过最速下降法，可以有效地最小化预期平均误差，即³0 Tpedt/Tp。这种方法避免了传统学习算法中可能需要的复杂计算，简化了学习过程，同时保持了对动力系统控制的有效性。在动力系统控制的应用中，CTRNN能够利用波动驱动学习规则动态调整其参数，以适应系统的动态变化，实现对系统的有效控制。这项研究提出了一种创新的学习策略，不仅适用于连续时间递归神经网络的训练，而且在处理动力系统控制等实际问题时展现出潜力。波动驱动学习规则提供了一种自适应且无需额外反馈信号的解决方案，对于理解和改进神经网络的学习性能具有重要意义。

Fluctuation-Driven Learning Rule for Continuous-Time

Recurrent Neural Networks and Its Application to Dynamical

System Control

Kazuhisa Watanabe, Takahiro Haba, Noboru Kudo, and Takahumi Oohori

Faculty of Engineering, Hokkaido Institute of Technology, Sapporo, 006-8585 Japan

SUMMARY

Fluctuation-driven learning rule is proposed for con-

tinuous-time recurrent neural networks. In so doing, ran-

dom fluctuations

p,

t (j: neuron number, p: input pattern

number, 0 d

: pattern length) are superimposed on

every neurons threshold. Probability density N

n

 of fluc-

tuation amplitude is treated as a time-invariant, and auxil-

iary function g

n

: dN

/dn

is introduced. For

fluctuations n

p, t, neuron outputs r

p, t and instantane-

ous error ep,

 are probabilistic quantities. In so doing,

learning rule for synaptic weight w

from i-th neuron is

p,

t



P

e  R

dt/T

: time

constant of membrane potential, P : learning coefficient). It

is shown theoretically that expected mean error

(

edt

 may be minimized by steepest descent. This

learning rule does not require any additional functions such

as adjoint system or sensitivity system, and can be executed

in time-forward direction by simple integrating, which is

distinctive of previous algorithms. The features of the pro-

posed method are confirmed through numerical experi-

ments with JK flip-flop, dynamical systems inverse model,

nica, Syst Comp Jpn, 32(3): 1423, 2001

Key words: Fluctuation-driven learning; recurrent

neural network; JK-FF; dynamical inverse model; speed

control.

1. Introduction

Continuous-time recurrent neural networks (RNN),

with each neurons state given through differential equa-

tions, offer nonlinear dynamics with respect to variations

of internal states. Such networks offer most universal neural

circuit model, and are considered promising in such fields

as signal processing with complicated hysteresis effects, or

robot control.

Doya and Yoshizawa [1], Pearlmutter [2], and Sato

[3], almost at the same time, proposed learning algorithms

for continuous-time RNN. Analysis of learning properties

was conducted by Tokoi and colleagues [4] and Nakajima

and Ueda [5]. Doya and Yoshizawa [6], Sato and colleagues

[7], Lee and colleagues [8], and Adachi and Kotani [9]

studied recall learning of spatiotemporal patterns and pre-

dictive learning; the possibility of application of such algo-

rithms to spatiotemporal data processing is being verified

in terms of computation theory.

However, all aforementioned algorithms [13] were

developed on the same assumptions as error back-propaga-

tion algorithm for nonrecurrent (feed-forward) neural net-

works (FNN), namely: (1) teaching waveforms for each

visible neuron are given explicitly and (2) output functions

for all neurons are differentiable.

Such prerequisites restrict universality of time-con-

tinuous RNN. For example, in the case of control learning

of a dynamical system with unknown characteristics, learn-

ing-support subsystems such as forward models [10] are

Systems and Computers in Japan, Vol. 32, No. 3, 2001

Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J83-D-II, No. 3, March 2000, pp. 10341042

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38693657

粉丝: 0
资源: 926

波动驱动学习规则在连续时间循环神经网络控制中的应用

基于神经网络观测器的船舶轨迹跟踪递归滑模动态面输出反馈控制.pdf

波动驱动学习规则：连续时间递归神经网络与动力系统控制

永磁直线同步电机的鲁棒自适应神经网络控制

网络游戏-基于Elman神经网络模型的土壤重金属含量预测方法.zip

基于EMD改进的Elman神经网络对股票的短期预测模型.pdf

递归模糊神经网络驱动的PEMFC温度智能控制

基于二阶学习算法的神经网络在自适应光学系统变形镜电压预测中的性能优化

神经网络观测器增强永磁同步电机的自适应速度控制

深度学习驱动的住宅电力需求聚合预测研究

神经网络控制鲁棒性设计：应对不确定性与扰动的妙招

最新资源