遗传算法驱动的深度卷积神经网络自适应设计

需积分: 9 104 浏览量更新于2024-08-05 收藏 693KB PDF 举报

遗传CNN（Genetic CNN）是一种前沿的深度学习架构搜索方法，它旨在解决深度卷积神经网络（Deep Convolutional Neural Networks, CNN）设计中的复杂性。传统的CNN在大规模视觉识别任务中表现出色，研究人员通过增加网络深度、构建高通路连接等策略来提升性能，但手动设计网络结构往往受限于人类专家的经验和创造力。随着网络层数的增长，潜在的网络结构组合呈指数级增长，这使得传统的手工设计变得难以应对庞大的搜索空间。为了克服这个挑战，遗传算法被引入到CNN结构的自动化设计中。遗传算法的核心思想是将每个网络结构编码为固定长度的二进制字符串，这样可以方便地进行遗传操作，如选择、突变和交叉，这些操作有助于寻找最有效的网络配置。遗传算法的流程如下： 1. 初始化阶段：随机生成一组个体，每个个体代表一个可能的网络结构。这些结构可能包含不同数量的卷积层、池化层、全连接层以及各种类型的激活函数和优化器设置。 2. 选择：在每一代中，根据适应度函数（通常衡量模型在特定数据集上的性能）选择出表现优秀的个体，以便将其遗传给下一代。 3. 突变：对选出的个体进行变异操作，比如改变某些层的参数，或者调整网络连接方式，以引入新的可能性。 4. 交叉：通过基因重组，将两个或更多个体的部分结构合并，生成新的混合结构，进一步探索搜索空间。 5. 重复迭代：经过多代的进化过程，算法逐渐收敛到具有较好性能的网络结构，这些结构可能超越了人类设计者的设计。遗传CNN的优势在于其能够处理大量的网络结构可能性，并且在没有先验知识的情况下，通过自然选择和适应性优化找到潜在的最优解决方案。这种方法不仅节省了人工设计的时间，还能发掘出可能被忽视的高效网络结构，对于推动深度学习领域特别是计算机视觉任务的性能提升具有重要意义。然而，遗传算法的效率和结果质量依赖于编码方式、适应度函数的选择以及算法参数的调优，这些都是未来研究的关键方向。

Genetic CNN

Lingxi Xie, Alan Yuille

Department of Computer Science, The Johns Hopkins University, Baltimore, MD, USA

198808xc@gmail.com alan.l.yuille@gmail.com

Abstract

The deep convolutional neural network (CNN) is the

state-of-the-art solution for large-scale visual recognition.

Following some basic principles such as increasing network

depth and constructing highway connections, researchers

have manually designed a lot of ﬁxed network architectures

and veriﬁed their effectiveness.

In this paper, we discuss the possibility of learning deep

network structures automatically. Note that the number

of possible network structures increases exponentially with

the number of layers in the network, which motivates us to

adopt the genetic algorithm to efﬁciently explore this large

search space. The core idea is to propose an encoding

method to represent each network structure in a ﬁxed-length

binary string. The genetic algorithm is initialized by gen-

erating a set of randomized individuals. In each genera-

tion, we deﬁne standard genetic operations, e.g., selection,

mutation and crossover, to generate competitive individuals

and eliminate weak ones. The competitiveness of each

individual is deﬁned as its recognition accuracy, which is

obtained via a standalone training process on a reference

dataset. We run the genetic process on CIFAR10, a small-

scale dataset, demonstrating its ability to ﬁnd high-quality

structures which are little studied before. The learned pow-

erful structures are also transferrable to the ILSVRC2012

dataset for large-scale visual recognition.

1. Introduction

Visual recognition is a fundamental task in computer

vision, implying a wide range of applications. Recently, the

state-of-the-art algorithms on visual recognition are mostly

based on the deep Convolutional Neural Network (CNN).

Starting from the fundamental chain-styled network model-

s [19], researchers have been increasing the depth of the

network [32], as well as designing novel network mod-

ules [36][13] to improve recognition accuracy. Although

these modern networks have been shown to be efﬁcient, we

note that their structures are manually designed, not learned,

which limits the ﬂexibility of the approach.

In this paper, we reveal the possibility of automatically

learning the structure of deep neural networks. We consider

a constrained case, in which the network has a limited

number of stages, and each stage is deﬁned as a set of pre-

deﬁned building blocks such as convolution and pooling

layers. Even under these limitations, the total number of

possible network structures grows exponentially with the

number of layers, making it impractical to enumerate all the

candidates and ﬁnd the best one. Instead, we formulate this

problem as optimization in a large search space, and apply

the genetic algorithm to exploring the space efﬁciently.

The genetic algorithm involves constructing an initial

population of individuals, and performing genetic opera-

tions to allow them to evolve in an iterative process. We

propose a novel encoding scheme to represent each network

structure as a ﬁxed-length binary string, and deﬁne several

standard genetic operations, i.e., selection, mutation and

crossover, so that new competitive individuals are generated

from the previous generation and weak ones are eliminated.

The quality (ﬁtness function) of each individual is deter-

mined by its recognition accuracy on a reference dataset.

To this end, we perform a complete training process for

each individual (i.e., network structure) which is inde-

pendent to the genetic algorithm. The genetic process

comes to an end after a ﬁxed number of generations.

It is worth emphasizing that the genetic algorithm is

computationally expensive, as we need to undergo a com-

plete network training process for each generated individ-

ual. We adopt the strategy to run the genetic process on a

small dataset ( CIFAR10), in which we observe the ability

of the genetic algorithm to ﬁnd effective network struc-

tures, and then transfer the learned top-ranked structures

to perform large-scale visual recognition. The learned

structures, most of which have been less studied before,

often perform better than the manually designed ones in

either small-scale or large-scale experiments.

The remainder of this paper is organized as follows.

Section 2 brieﬂy introduces related work. Section 3 il-

lustrates the way of using the genetic algorithm to design

network structures. Experiments are shown in Section 4,

and conclusions are drawn in Section 5.

1379

下载后可阅读完整内容，剩余9页未读，立即下载

maerdym

粉丝: 307
资源: 20

遗传算法驱动的深度卷积神经网络自适应设计

Genetic CNN

Genetic-CNN-master_CNN_CNN网络_遗传算法cnn_遗传算法_

CNN_Genetic_algorithm:使用GA查找最佳超参数

GA-CNN遗传算法程序.rar

cgp-cnn:设计CNN架构的遗传程序设计方法，在GECCO 2017中（口头演示，最佳论文奖）

遗传_adam_CNN_加入归一化附python代码.zip

Genetic Algorithm-遗传算法附matlab代码.zip

genetic_algorithm:基于适用于特定数据集的适应度函数，用于发展深层神经网络的最佳结构的遗传算法框架的实现

基于Matlab的温度预测遗传算法CNN-LSTM-Multihead-Attention模型实现

遗传算法优化CNN-LSTM-Attention网络实现风电功率预测

最新资源