随机连接神经网络：新视角探索图像识别

需积分: 10 181 浏览量更新于2024-09-07 收藏 2.23MB PDF 举报

"这篇论文由Facebook AI Research (FAIR) 的研究人员Saining Xie、Alexander Kirillov、Ross Girshick以及何恺明(Kaiming He)共同撰写，标题为‘Exploring Randomly Wired Neural Networks for Image Recognition’，探讨了在图像识别任务中使用随机连接的神经网络结构。研究主要集中在通过神经结构搜索(NAS)来寻找更多样化的连接模式，以超越传统的手动设计网络。他们提出了一种名为RandWire的新网络，在ImageNet基准测试中取得了较高的准确性，这表明随机连接的网络结构可能具有巨大的潜力和竞争力。" 在深度学习领域，神经网络的设计一直是提高模型性能的关键因素。早期的神经网络结构通常是简单的链式结构，如卷积神经网络(CNN)。随着研究的深入，ResNets和DenseNets等创新的网络结构引入了多路径连接，显著提高了模型的表达能力和训练效果。然而，尽管神经架构搜索(NAS)已经在自动化设计网络结构方面取得了进步，但其搜索空间仍然受到人为设计的限制。这篇论文提出了一种全新的方法，即通过随机生成网络结构，来探索更加多样化和复杂的连接模式。作者定义了一个随机网络生成器的概念，它可以将整个网络生成过程封装起来，统一处理NAS和随机连接网络。这种抽象使得研究者能够使用随机图模型（如经典的大规模图模型）来生成网络结构。实验结果显示，几种随机生成器的变体都能够产生具有竞争力的图像识别性能。通过使用随机图模型，研究者可以模拟出各种可能的连接方式，这包括但不限于完全随机连接、局部规则连接等。这种方法的优势在于它能够突破现有的设计范式，发现新的有效网络结构，且不完全依赖于人工设计。此外，这种随机性可能有助于模型更好地泛化，因为它可以在训练过程中生成不同的结构，从而增强模型的鲁棒性。这项工作在理解神经网络结构复杂性的道路上迈出了重要的一步，它展示了随机连接的网络如何在不牺牲性能的情况下提供更多的灵活性和多样性。这对于未来优化深度学习模型、减少人工设计的依赖以及推动神经网络架构的进一步发展具有重要意义。通过这样的研究，我们有望找到更高效、更强大且适应性强的神经网络结构，以应对各种计算机视觉任务和其他领域的挑战。

Exploring Randomly Wired Neural Networks for Image Recognition

Saining Xie Alexander Kirillov Ross Girshick Kaiming He

Facebook AI Research (FAIR)

Abstract

Neural networks for image recognition have evolved

through extensive manual design from simple chain-like

models to structures with multiple wiring paths. The suc-

cess of ResNets [11] and DenseNets [16] is due in large

part to their innovative wiring plans. Now, neural architec-

ture search (NAS) studies are exploring the joint optimiza-

tion of wiring and operation types, however, the space of

possible wirings is constrained and still driven by manual

design despite being searched. In this paper, we explore a

more diverse set of connectivity patterns through the lens of

randomly wired neural networks. To do this, we ﬁrst deﬁne

the concept of a stochastic network generator that encap-

sulates the entire network generation process. Encapsula-

tion provides a uniﬁed view of NAS and randomly wired net-

works. Then, we use three classical random graph models

to generate randomly wired graphs for networks. The re-

sults are surprising: several variants of these random gen-

erators yield network instances that have competitive ac-

curacy on the ImageNet benchmark. These results suggest

that new efforts focusing on designing better network gen-

erators may lead to new breakthroughs by exploring less

constrained search spaces with more room for novel design.

1. Introduction

What we call deep learning today descends from the

connectionist approach to cognitive science [38, 7]—a

paradigm reﬂecting the hypothesis that how computational

networks are wired is crucial for building intelligent ma-

chines. Echoing this perspective, recent advances in com-

puter vision have been driven by moving from models with

chain-like wiring [19, 53, 42, 43] to more elaborate connec-

tivity patterns, e.g., ResNet [11] and DenseNet [16], that are

effective in large part because of how they are wired.

Advancing this trend, neural architecture search (NAS)

[55, 56] has emerged as a promising direction for jointly

searching wiring patterns and which operations to per-

form. NAS methods focus on search [55, 56, 33, 26, 29,

27] while implicitly relying on an important—yet largely

overlooked—component that we call a network generator

(deﬁned in §3.1). The NAS network generator deﬁnes a

family of possible wiring patterns from which networks

classiﬁer classiﬁer classiﬁer

conv

Figure 1. Randomly wired neural networks generated by the

classical Watts-Strogatz (WS) [50] model: these three instances

of random networks achieve (left-to-right) 79.1%, 79.1%, 79.0%

classiﬁcation accuracy on ImageNet under a similar computational

budget to ResNet-50, which has 77.1% accuracy.

are sampled subject to a learnable probability distribution.

However, like the wiring patterns in ResNet and DenseNet,

the NAS network generator is hand designed and the space

of allowed wiring patterns is constrained in a small subset

of all possible graphs. Given this perspective, we ask: What

happens if we loosen this constraint and design novel net-

work generators?

We explore this question through the lens of randomly

wired neural networks that are sampled from stochastic

network generators, in which a human-designed random

process deﬁnes generation. To reduce bias from us—the

authors of this paper—on the generators, we use three clas-

sical families of random graph models in graph theory [51]:

the Erd

os-R

enyi (ER) [6], Barab

asi-Albert (BA) [1], and

Watts-Strogatz (WS) [50] models. To deﬁne complete net-

works, we convert a random graph into a directed acyclic

graph (DAG) and apply a simple mapping from nodes to

their functional roles (e.g., to the same type of convolution).

arXiv:1904.01569v2 [cs.CV] 8 Apr 2019

下载后可阅读完整内容，剩余9页未读，立即下载

tox33

粉丝: 64

随机连接神经网络：新视角探索图像识别

Pattern Recognition with Neural Networks in C++

Deep Neural Networks in a Mathematical Framework-Springer(2018).pdf

EXPLORING THE INTERPRETABILITY OF LSTM NEURAL NETWORKS OVER MULTI-VARIABLE DATA

Learn Keras for Deep Neural Networks (2019)

Neural Networks in Unity: C# Programming for Windows 10 pdf

Deep Learning with TensorFlow: Explore neural networks with Python

Exploring MPE/MWE training for Chinese handwriting recognition

Exploring complex networks

exploring-neural-data-course:探索神经数据编程练习

Exploring the Risk Dietary Factors for theColorectal Cancer

最新资源