"深度学习pytorch版1详解：安装、基础概念与机器学习问题分类"

需积分: 0 127 浏览量更新于2024-01-21 收藏 26.95MB PDF 举报

《动手学深度学习PyTorch版1》是一本由Aston Zhang、Zachary C. Lipton、Mu Li和Alexander J. Smola共同撰写的深度学习教程，该书于2021年6月25日发布了0.16.6版本。本书共包括以下内容：前言、安装指南、符号说明、引言、激励示例、关键组件、机器学习问题种类、深度学习的起源等内容。在本书的前言部分，作者们首先介绍了该书的目的和内容。他们旨在通过《动手学深度学习PyTorch版1》这本书帮助读者们深入了解PyTorch深度学习框架，并且通过实践指导读者实现深度学习模型。接着在安装指南部分，作者们提供了详细的PyTorch安装步骤，并介绍了如何配置PyTorch环境。这对读者们来说尤为重要，因为在学习深度学习的过程中，良好的环境配置能够提高学习的效率和质量。在符号说明部分，作者们为读者们提供了关于本书中使用的符号和术语的解释，这有助于读者在学习的过程中更好地理解和掌握相关知识。在引言部分，作者们从一个引人入胜的示例开始，激发了读者的学习兴趣。通过这个引言示例，读者可以直观地了解深度学习模型的应用和效果，从而激发了学习的动力。在关键组件部分，作者们介绍了深度学习中的一些关键组件，比如神经网络、损失函数、优化算法等。这些组件是构建深度学习模型的基础，了解这些组件对读者来说尤为重要。在机器学习问题种类部分，作者们介绍了不同类型的机器学习问题，比如监督学习、无监督学习和强化学习等。通过了解不同类型的机器学习问题，读者可以更好地选择合适的模型和算法来解决实际问题。在深度学习的起源部分，作者们介绍了深度学习的起源和发展历程，这有助于读者更好地理解深度学习模型的发展和演进。总的来说，《动手学深度学习PyTorch版1》是一本介绍深度学习的经典教程，它以简洁清晰的语言和丰富的实例为读者介绍了深度学习的基本概念和技术，并且通过实践指导读者应用PyTorch框架实现深度学习模型。这本书对于想要学习深度学习的读者来说是一本不可多得的好书，它不仅为读者提供了丰富的学习资源，同时也能够帮助读者快速上手深度学习。

19.1.1 Editing and Running the Code Locally . . . . . . . . . . . . . . . . . . . . 897

19.1.2 Advanced Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 901

19.2 Using Amazon SageMaker . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 902

19.2.1 Registering and Logging In . . . . . . . . . . . . . . . . . . . . . . . . . . 902

19.2.2 Creating a SageMaker Instance . . . . . . . . . . . . . . . . . . . . . . . . 903

19.2.3 Running and Stopping an Instance . . . . . . . . . . . . . . . . . . . . . . 904

19.2.4 Updating Notebooks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 905

19.3 Using AWS EC2 Instances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 906

19.3.1 Creating and Running an EC2 Instance . . . . . . . . . . . . . . . . . . . . 906

19.3.2 Installing CUDA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 911

19.3.3 Installing MXNet and Downloading the D2L Notebooks . . . . . . . . . . . 912

19.3.4 Running Jupyter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 913

19.3.5 Closing Unused Instances . . . . . . . . . . . . . . . . . . . . . . . . . . . 914

19.4 Using Google Colab . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 914

19.5 Selecting Servers and GPUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 915

19.5.1 Selecting Servers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 916

19.5.2 Selecting GPUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 917

19.6 Contributing to This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 920

19.6.1 Minor Text Changes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 920

19.6.2 Propose a Major Change . . . . . . . . . . . . . . . . . . . . . . . . . . . 920

19.6.3 Adding a New Section or a New Framework Implementation . . . . . . . . 921

19.6.4 Submitting a Major Change . . . . . . . . . . . . . . . . . . . . . . . . . . 921

19.7 d2l API Document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 925

Bibliography 949

Python Module Index 959

Index 961

xiv

Preface

Just a few years ago, there were no legions of deep learning scientists developing intelligent prod-

ucts and services at major companies and startups. When we entered the eld, machine learning

did not command headlines in daily newspapers. Our parents had no idea what machine learning

was, let alone why we might prefer it to a career in medicine or law. Machine learning was a blue

skies academic discipline whose industrial signicance was limited to a narrow set of real-world

applications, including speech recognition and computer vision. Moreover, many of these appli-

cations required so much domain knowledge that they were oen regarded as entirely separate

areas for which machine learning was one small component. At that time, neural networks—the

predecessors of the deep learning methods that we focus on in this book—were generally regarded

as outmoded.

In just the past ve years, deep learning has taken the world by surprise, driving rapid progress in

such diverse elds as diverse as computer vision, natural language processing, automatic speech

recognition, reinforcement learning, biomedical informatics, and has even catalyzed develop-

ments in theoretical machine learning and statistics. With these advances in hand, we can now

build cars that drive themselves with more autonomy than ever before (and less autonomy than

some companies might have you believe), smart reply systems that automatically dra the most

mundane emails, helping people dig out from oppressively large inboxes, and soware agents that

dominate the worldʼs best humans at board games like Go, a feat once thought to be decades away.

Already, these tools exert ever-wider impacts on industry and society, changing the way movies

are made, diseases are diagnosed, and playing a growing role in basic sciences—from astrophysics

to biology.

About This Book

This book represents our attempt to make deep learning approachable, teaching you the concepts,

the context, and the code.

One Medium Combining Code, Math, and HTML

For any computing technology to reach its full impact, it must be well-understood, well-

documented, and supported by mature, well-maintained tools. The key ideas should be clearly

distilled, minimizing the onboarding time needing to bring new practitioners up to date. Mature

libraries should automate common tasks, and exemplar code should make it easy for practitioners

to modify, apply, and extend common applications to suit their needs. Take dynamic web appli-

cations as an example. Despite a large number of companies, like Amazon, developing successful

database-driven web applications in the 1990s, the potential of this technology to aid creative en-

trepreneurs has been realized to a far greater degree in the past ten years, owing in part to the

development of powerful, well-documented frameworks.

Testing the potential of deep learning presents unique challenges because any single application

brings together various disciplines. Applying deep learning requires simultaneously understand-

ing (i) the motivations for casting a problem in a particular way; (ii) the mathematical form of a

given model; (iii) the optimization algorithms for tting the models to data; (iv) the basic statisti-

cal principles and intuitions that help us to extract generalizable insights from data; and (v) the

engineering required to train models eciently, navigating the pitfalls of numerical computing

and getting the most out of available hardware. Teaching both the critical thinking skills required

to formulate problems, the mathematics to solve them, and the soware tools to implement those

solutions all in one place presents formidable challenges. Our goal in this book is to present a

unied resource to bring would-be practitioners up to speed.

When we started this book project, there were no resources that simultaneously (i) were up to

date; (ii) covered the full breadth of modern machine learning with substantial technical depth;

and (iii) interleaved exposition of the quality one expects from an engaging textbook with the

clean runnable code that one expects to nd in hands-on tutorials. We found plenty of code exam-

ples for how to use a given deep learning framework (e.g., how to do basic numerical computing

with matrices in TensorFlow) or for implementing particular techniques (e.g., code snippets for

LeNet, AlexNet, ResNets, etc) scattered across various blog posts and GitHub repositories. How-

ever, these examples typically focused on how to implement a given approach, but le out the

discussion of why certain algorithmic decisions are made. While some interactive resources have

popped up sporadically to address a particular topic, e.g., the engaging blog posts published on the

website Distill

, or personal blogs, they only covered selected topics in deep learning, and oen

lacked associated code. On the other hand, while several deep learning textbooks have emerged—

e.g., (Goodfellow et al., 2016), which oers a comprehensive survey of the concepts behind deep

learning—these resources do not marry the descriptions to realizations of the concepts in code,

sometimes leaving readers clueless as to how to implement them. Moreover, too many resources

are hidden behind the paywalls of commercial course providers.

We set out to create a resource that could (i) be freely available for everyone; (ii) oer sucient

technical depth to provide a starting point on the path to actually becoming an applied machine

learning scientist; (iii) include runnable code, showing readers how to solve problems in practice;

(iv) allow for rapid updates, both by us and also by the community at large; and (v) be comple-

mented by a forum

for interactive discussion of technical details and to answer questions.

These goals were oen in conict. Equations, theorems, and citations are best managed and laid

out in LaTeX. Code is best described in Python. And webpages are native in HTML and JavaScript.

Furthermore, we want the content to be accessible both as executable code, as a physical book, as

a downloadable PDF, and on the Internet as a website. At present there exist no tools and no work-

ow perfectly suited to these demands, so we had to assemble our own. We describe our approach

in detail in Section 19.6. We settled on GitHub to share the source and to facilitate community con-

tributions, Jupyter notebooks for mixing code, equations and text, Sphinx as a rendering engine

to generate multiple outputs, and Discourse for the forum. While our system is not yet perfect,

these choices provide a good compromise among the competing concerns. We believe that this

might be the rst book published using such an integrated workow.

http://distill.pub

http://discuss.d2l.ai

2 Contents

Learning by Doing

Many textbooks present concepts in succession, covering each in exhaustive detail. For example,

Chris Bishopʼs excellent textbook (Bishop, 2006), teaches each topic so thoroughly that getting to

the chapter on linear regression requires a non-trivial amount of work. While experts love this

book precisely for its thoroughness, for true beginners, this property limits its usefulness as an

introductory text.

In this book, we will teach most concepts just in time. In other words, you will learn concepts at the

very moment that they are needed to accomplish some practical end. While we take some time at

the outset to teach fundamental preliminaries, like linear algebra and probability, we want you to

taste the satisfaction of training your rst model before worrying about more esoteric probability

distributions.

Aside from a few preliminary notebooks that provide a crash course in the basic mathematical

background, each subsequent chapter introduces both a reasonable number of new concepts and

provides single self-contained working examples—using real datasets. This presents an organi-

zational challenge. Some models might logically be grouped together in a single notebook. And

some ideas might be best taught by executing several models in succession. On the other hand,

there is a big advantage to adhering to a policy of one working example, one notebook: This makes

it as easy as possible for you to start your own research projects by leveraging our code. Just copy

a notebook and start modifying it.

We will interleave the runnable code with background material as needed. In general, we will

oen err on the side of making tools available before explaining them fully (and we will follow up

by explaining the background later). For instance, we might use stochastic gradient descent before

fully explaining why it is useful or why it works. This helps to give practitioners the necessary

ammunition to solve problems quickly, at the expense of requiring the reader to trust us with

some curatorial decisions.

This book will teach deep learning concepts from scratch. Sometimes, we want to delve into ne

details about the models that would typically be hidden from the user by deep learning frame-

worksʼ advanced abstractions. This comes up especially in the basic tutorials, where we want you

to understand everything that happens in a given layer or optimizer. In these cases, we will oen

present two versions of the example: one where we implement everything from scratch, relying

only on NumPy-like functionality and automatic dierentiation, and another, more practical ex-

ample, where we write succinct code using the high-level APIs of deep learning frameworks. Once

we have taught you how some component works, we can just use the high-level APIs in subsequent

tutorials.

Contents 3

剩余978页未读，继续阅读

ShenPlanck

粉丝: 951
资源: 343

"深度学习pytorch版1详解：安装、基础概念与机器学习问题分类"

动手学深度学习源码PYtorch版本

动手学深度学习pytorch版本

Python-动手学深度学习例子的PyTorch实现

动手学深度学习PyTorch版

动手学深度学习PyTorch版学习笔记2

动手学深度学习Pytorch版Task05

动手学深度学习Pytorch版Task04

动手学深度学习Pytorch版Task03

动手学深度学习Pytorch版.zip

动手学深度学习Pytorch版本学习笔记 Task3

最新资源