NEAT：神经网络进化的遗传算法

5星 · 超过95%的资源需积分: 10 37 浏览量更新于2024-07-26 收藏 14.65MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"这篇文档是肯尼斯·斯坦利（Kenneth Stanley）于2002年在德克萨斯大学奥斯汀分校所发展的NeuroEvolution of Augmenting Topologies (NEAT)算法的详细报告。NEAT是一种用于神经网络进化的遗传算法，旨在寻找演化解决方案的适应度与多样性之间的平衡。它主要依靠三个关键技术：历史标记基因跟踪以允许不同拓扑结构间的交叉，应用物种演化（物种形成）来保护创新，以及从简单初始结构逐步发展网络拓扑（复杂化）。" NEAT（神经进化增益结构）是遗传算法的一个重要分支，它专门针对人工神经网络的演化设计。该算法的核心目标是通过优化权重参数和网络结构来提高神经网络的性能，同时保持种群的多样性，防止过早收敛到局部最优解。首先，历史标记基因技术允许NEAT在具有不同连接结构的神经网络之间进行交叉操作。每个神经元和连接都被赋予一个历史标识符，这使得即使网络结构不同，也可以进行有效的基因重组，从而促进新结构的出现和旧结构的改进。其次，物种形成的概念是NEAT中的关键创新。它将相似的网络分组成“物种”，每个物种内部的成员更可能进行交配，而不同物种之间的交配则受到限制。这种机制保护了新出现的、具有创新性的网络结构，使它们有足够的时间来成熟和发展，而不是被已经适应环境的网络迅速淘汰。最后，NEAT采用了一种逐步复杂化的策略，从简单的网络结构开始，通过迭代学习逐步增加复杂性。这种方法避免了从一开始就尝试构建复杂的网络，这可能导致训练困难或过度拟合。在实际应用中，NEAT已被广泛用于解决各种问题，如控制机器人行为、图像识别、游戏AI等。它展示了在不断变化和复杂的问题环境中，如何有效地演化出适应性强且高效的神经网络结构。 NEAT算法提供了一种强大的工具，通过模拟生物进化过程来优化神经网络，其对机器学习和人工智能领域的贡献在于它能够发现复杂而有创新性的解决方案，同时维持种群的多样性，从而提高了算法的长期适应性和探索能力。

资源详情

资源推荐

Chapter 1

Introduction

A fundamental motivation of machine learning (ML) is to discover solutions to signiﬁcant real-

world problems. An important class of such problems requires discovering behavior policies for

autonomous agents such as vehicles, robots, and game characters automatically. Consider for ex-

ample the challenging goal of creating a video game in which characters learn on their own to

adapt to the player’s behavior (ﬁgure 1.1). As soon as human players begin to exploit characters’

weakness, they could change their strategies and the game would become challenging again. Such

a technology would allow the game to remain interesting far longer than today’s games, and such

games could even be used effectively to train people in various interactive real world tasks.

ML is necessary for such a system: Without learning, developers would need to script all

possible contingencies into the system a priori, and have the system switch among them in reaction

to players’ behavior. In addition, many sophisticated strategies are difﬁcult to program or even

envision, and would only be possible to achieve through learning. Learning adds ﬂexibility not only

in games, but in many real-world scenarios, such as automated driving, military tactics, and robot

control.

Sophisticated behaviors are difﬁcult to discover in part because they are likely to be ex-

tremely complex, perhaps requiring the optimization of thousands or even millions of parameters.

Searching through such high-dimensional space is intractable even for the most powerful methods.

This dissertation describes a method for discovering complex neural network-controlled behaviors

by gradually building up to a solution in an evolutionary process called complexiﬁcation. The high-

dimensional space of the ﬁnal solution is only encountered at the very end of the search.

This chapters begins by motivating complexiﬁcation, then brieﬂy describes the approach,

and concludes with an overview of the results and contributions of the dissertation.

(a) Robots approach ﬂag

(b) Player attacks on left

Figure 1.1: Video game characters adapt to player’s actions (color ﬁgure). The robots in these screen-

shots are video game characters that spawn from the top of the screen and must approach the ﬂag (circled) at

the bottom left. (a) The robots ﬁrst learn to take the left hallway since it is the shortest path to the ﬂag. (b)

A human player (identiﬁed by a square) attacks inside the left hallway and decimates the robots. (c) Even

though the left hallway is the shortest path to the ﬂag, the robots learn that they can avoid the human enemy

by taking the right hallway, which is protected from the human’s ﬁre by a wall. These screenshots are taken

from the NERO video game (Chapter 9), in which robots learn to adapt to the player’s tactics as the game is

played. Machine learning is necessary for such an application to work.

1.1 Motivation

Neuroevolution (NE), the artiﬁcial evolution of neural networks using genetic algorithms, has shown

great promise in complex reinforcement learning (RL) problems (Floriano and Mondada 1994;

Gomez and Miikkulainen 2003; Gruau et al. 1996; Harvey 1993; Moriarty et al. 1999; Nolﬁ et al.

1994; Potter et al. 1995; Aharonov-Barki et al. 2001; Whitley et al. 1993). Neuroevolution searches

through the space of behaviors for a network that performs well at a given task. This approach to

solving complex control problems is an alternative to statistical techniques that attempt to estimate

the utility of particular actions in particular states of the world (Kaelbling et al. 1996; Sutton and

Barto 1998). NE is a promising approach to learning behavioral policies and ﬁnds solutions faster

than leading RL methods on many benchmark tasks (Gomez 2003; Moriarty and Miikkulainen

1997).

In traditional NE approaches, a topology is chosen for the evolving networks before training

begins (Montana and Davis 1989; Saravanan and Fogel 1995; Wieland 1991). Usually, the network

topology is a single hidden layer of neurons, with each hidden neuron connected to every network

input and output. Evolution searches the space of connection weights of this fully-connected topol-

ogy by allowing high-performing networks to reproduce. The weight space is explored through the

crossover of network weight vectors and through the mutation of single networks’ weights. Thus,

the goal of ﬁxed-topology NE is to optimize the connection weights that determine the functionality

of a network.

However, connection weights are not the only aspect of neural networks that contribute to

their behavior. Their topology, or structure, also affects how they function. What the appropriate

topology is for any particular behavior is generally not known. However, topology is an important

factor because it determines the size of the solution, and hence the size of the space in which the

solution can be found. Searching in too large a space may be intractable; In fact, if the solution

contains many dimensions, even searching in the space of the actual solution may be intractable.

On the other hand, searching in too small a space may fail because the solution may not exist in that

space.

Determining the right topology is important also because many common structures that the

neural networks may need to represent and process are deﬁned by an indeﬁnite number of parame-

ters. For example, the number of parts in electronic circuits and robot controllers can vary (Miller

et al. 2000a; Stanley and Miikkulainen 2004). Moreover, although theoretically two neural net-

works with different numbers of connections and nodes can represent the same function (Cybenko

1989), they may not be equally efﬁcient to run nor equally easy to discover. Thus, it is not clear

what network topology is appropriate for solving a particular problem. Methods that search in ﬁxed

spaces must rely on heuristics to determine the appropriate topology a priori.

In neuroevolution, the topology is deﬁned by the network’s genetic encoding, and the size

of the encoding, i.e. the number of genes, is a crucial factor determining the network topology. In

highly complex domains the heuristics for determining the appropriate size are not very useful, and

it becomes increasingly difﬁcult to solve such domains with ﬁxed-length encodings. For example,

how many nodes and connections are necessary for a neural network that controls a robotic maid?

The answers to such questions can hardly be based on empirical experience or analytic methods,

since little is known about the solutions. One possible approach is to simply make the genetic

encoding extremely large, so that the space it encodes is extremely large and a solution is likely to

lie somewhere within. Yet the larger the encoding, the higher dimensional the space that evolution

needs to search. Even if a robotic maid lies somewhere in the 10,000 dimensional space of a 10,000

gene encoding, searching such a space may take prohibitively long.

Even more problematic are open-ended problems where behaviors and strategies are meant

to increase in sophistication indeﬁnitely and there is no known ﬁnal solution. For example, in

competitive games, it is not possible to estimate the complexity of the “best” possible player in

order to decide the size of a ﬁxed-length genome; Similarly, many artiﬁcial life domains are aimed

at evolving increasingly complex artiﬁcial creatures for as long as possible (Maley 1999), which is

difﬁcult with a ﬁxed encoding for two reasons: (1) When a good strategy is found in a ﬁxed-length

encoding, the entire representational space is used to encode it. Thus, the only way to improve it is

to alter the strategy, thereby sacriﬁcing some of the functionality learned over previous generations.

(2) Fixing the size of the encoding in such domains arbitrarily limits how complex the evolved

controller can be, defeating the purpose of the experiment.

In order to discover solutions to difﬁcult real-world problems and to open-ended problems,

a method is needed that can automatically estimate the right number of dimensions for the solution.

Even if that solution exists in high-dimensional space, search should spend the majority of time in

lower-dimensional space building up a foundation for the ﬁnal solution. Such a method is developed

and evaluated in this dissertation.

1.2 Approach

The NeuroEvolution of Augmenting Topologies (NEAT) method for evolving artiﬁcial neural net-

works is designed to take advantage of structure as a way of minimizing the dimensionality of the

search space. Evolution starts with a population of small, simple genomes and systematically elab-

orates on them over generations by adding new genes. Each new gene expands the search space,

adding a new dimension that previously did not exist. That way, evolution begins searching in a

small space that is easily optimized, and adds new dimensions as necessary. This approach is more

likely to discover highly complex phenotypes than an approach that begins searching directly in the

intractably large space of complete solutions. In fact, natural evolution utilizes this strategy, occa-

sionally adding new genes that make the phenotype more complex (Martin 1999; Section 2.4.4). In

biology, this process of incremental elaboration is called complexiﬁcation, which is why this term

is used to describe the computational approach in this dissertation as well.

Evolving structure incrementally presents several technical challenges: (1) Is there a genetic

representation that allows disparate topologies to cross over in a meaningful way? (2) How can

topological innovation that needs a few generations to be optimized be protected so that it does

not disappear from the population prematurely? (3) How can topologies be minimized throughout

evolution without a contrived ﬁtness function that measures complexity explicitly?

NEAT meets these challenges through three technical components: (1) Keeping track of

which genes match up with which among differently sized genomes throughout evolution; (2) spe-

ciating the population so that solutions of differing complexity can exist independently; and (3)

starting evolution with a uniform population of small networks. These components work together

in complexifying solutions as part of the evolutionary process. The resulting method can evolve a

diverse population of increasingly complex topologies separated into unique species. This approach

results in powerful evolution that can solve benchmark problems faster than previous methods, and

also makes entirely new applications possible.

1.3 Contributions and Impact

The main contribution of this dissertation is a principled method for evolving increasingly complex

neural network topologies. Several experiments demonstrate the beneﬁts of NEAT and complexiﬁ-

cation, and others suggest how the approach can be used to solve signiﬁcant real-world problems.

First, NEAT is compared to both traditional reinforcement learning techniques and other

neuroevolution methods in the challenging task of balancing two poles on a moving cart. The

results establish that NEAT is able to take advantage of topology in order to speed up the search,

resulting in highly efﬁcient problem solving.

Second, the most signiﬁcant beneﬁt of complexiﬁcation is that it allows continual coevo-

lution, i.e. continual innovation in a competition. Because NEAT can complexify, it continually

elaborates on its solutions, leading to increasingly sophisticated strategies. Thus, NEAT can evolve

strategies and behaviors that would be difﬁcult or impossible to discover in any other way.

Third, two new techniques are introduced that expand neuroevolution to novel domains: (1)

The neural model is enhanced in two alternative ways to allow neural networks to adapt over their

lifetime: First, networks adapt through synaptic plasticity and second they adapt using activation

state changes. The two properties turn out to have different strengths and apply to different kinds

of tasks. (2) A real-time version of NEAT is developed that allows evolution to occur while a user

interacts with the system. This technique makes a new genre of video games possible and creates

new opportunities for training and educational software.

NEAT is tested in two real-world applications in addition to video games, demonstrating

that the technology can have a signiﬁcant and versatile impact in practice. NEAT evolves Go-

playing neural networks that can defeat the leading public domain Go program on a 7×7 board, and

produces networks that can warn the driver before crashing a car on a simulated road, a technology

that may one day save lives.

The key to all these contributions is complexiﬁcation. Interestingly, the process of complex-

iﬁcation is not limited to neural networks; complexity is ubiquitous in many important structures

from biological organisms to space stations. NEAT is a ﬁrst step towards an automated method for

discovering complex structures across domains, and opens up exciting avenues for future research.

1.4 Overview of the Dissertation

The dissertation is divided into ﬁve main parts: Foundations (Chapter 2), the NEAT method (Chapter

3), Evaluation (Chapters 4, 5, and 6), Applications (Chapters 7, 8, and 9), and Discussion and

Conclusion (Chapters 10, and 11).

Chapter 2 reviews prior work in neuroevolution, focusing on three major challenges for

evolving a population of diverse network topologies: (1) How can networks in a population of

diverse topologies be crossed over and compared? (2) How can innovative solutions be protected?

(3) How can the size of the search space be minimized?

Chapter 3 presents the NEAT method as the solution to these challenges: Historical mark-

ings on genes ensure that topologies remain compatible, speciation is used to to protect innovation,

and the search space is minimized by starting with small networks and incrementally adding com-

plexity.

Chapter 4 focuses on performance evaluation. NEAT is ﬁrst tested on the XOR problem

to determine whether it can evolve topology when necessary, and then compared to traditional re-

inforcement learning and other neuroevolution methods on the challenging task of balancing two

poles on a moving cart. These comparisons establish that NEAT is efﬁcient at solving well-known

剩余179页未读，继续阅读

liuxinbjut

粉丝: 0
资源: 4

NEAT：神经网络进化的遗传算法

NEAT神经归化算法的Python实现

Go-NEAT增强拓扑的神经演化神经演化算法的一个Go实现

learning-AI：人工智能与机器学习，神经进化（NEAT），TensorFlow.js，遗传算法

进化神经网络搜索算法开源代码

tensorflow如何python训练可视化例如贪吃蛇，五子棋，病毒传播，ai，怎样可以有进化遗传的功能吗

协方差矩阵进化策略 增强拓扑的神经进化 自然进化策略

以sloppy people vs. neat people为题写作

神经网络python包_实现连续时间递归神经网络(CTRNNs)的Python包

https://www.neat-reader.cn/

neat download manager

俄罗斯方块人机对战系统，国内外研究现状，给我相关网站

neatdownloadmanager谷歌插件

使用pyspark编写TF-IDF算法，并自由发挥相关使用场景

2. (2 points) Please find the dates without any placed order between the time range. (Hint: write a recursive query to create date sequence and use the select statement in Q1 to assign user-defined variables so that you can have a neat recursive query.)

neatdownloadmanager extension downie4

火狐浏览器默认下载工具为NeatDownloadManager 如何取消

opencv 检测卷纸带缠绕圆柱是否整齐的程序

可以完成消除点或细线状噪声的操作是

最新资源

协方差矩阵进化策略增强拓扑的神经进化自然进化策略