深度学习入门：概念与数学基础

需积分: 10 164 浏览量更新于2024-07-18 收藏 20.68MB PDF 举报

《深度学习》(DeepLearningBook) 是由Ian Goodfellow、Yoshua Bengio和Aaron Courville合著的一本经典著作，专注于解决那些直观问题的深层学习解决方案。该书将计算机的学习能力与经验联系起来，通过层次化的概念理解世界，每个概念都以其与更简单概念的关系来定义。书中内容涵盖了深度学习的核心数学和机器学习基础知识。在第一章"Introduction"中，作者首先探讨了读者群体，明确指出这本书适合希望深入了解深度学习理论和技术的专业人士，以及对人工智能有兴趣但没有接触过复杂数学背景的初学者。章节中提到了历史趋势，强调了近年来深度学习在人工智能领域中的显著崛起，如卷积神经网络(CNN)和循环神经网络(RNN)的发展。第二部分深入剖析了"Applied Math and Machine Learning Basics"，其中详细介绍了线性代数的基础知识。从Scalars、Vectors、Matrices和Tensors的基本概念开始，作者讲解了矩阵和向量的乘法运算，以及如何理解和运用Identity和Inverse Matrices。线性依赖性和span的概念帮助读者理解变量之间的关系，而Norms（范数）则用于衡量向量的大小。此外，书还讨论了特殊类型的矩阵和向量，如Eigen decomposition（特征分解）和Singular Value Decomposition（奇异值分解），这两个工具在深度学习模型的训练和优化中扮演着关键角色。接着，Moore-Penrose Pseudoinverse（最小二乘法的逆）和Trace Operator（迹运算）等概念被用来处理线性系统的求解。Determinant（行列式）则在理解矩阵的性质和变换时至关重要。作者还通过一个实例——主成分分析(PCA)，展示了这些概念在实际问题中的应用。第三部分转向"Probability and Information Theory"，这是深度学习理论基石之一。作者解释了概率在决策和预测中的核心作用，并介绍Random Variables（随机变量）、Probability Distributions（概率分布）、Marginal Probability（边缘概率）和Conditional Probability（条件概率）。链式条件概率规则和Independence/Conditional Independence（独立性/条件独立性）的概念是建立复杂模型的基础。此外，Expectation（期望）、Variance（方差）和Covariance（协方差）这些统计量对于理解数据的不确定性至关重要，它们在深度学习模型的评估和正则化中不可或缺。《深度学习》这本书提供了一个扎实的数学和概率基础框架，为读者探索现代深度学习算法背后的理论原理和实践技巧奠定了坚实的基础。通过系统学习，读者不仅能掌握深度学习方法，还能理解如何将其应用于实际问题中，推动人工智能技术的进步。

Chapter

tro

duction

tors

long

dreamed

creating

mac

hines

that

think.

This

desire

dates

bac

least

the

time

ancien

Greece.

The

ythical

ﬁgures

Pygmalion,

Daedalus,

and

Hephaestus

all

interpreted

legendary

tors,

and

Galatea,

alos,

and

andora

may

all

regarded

artiﬁcial

life

(

Ovid

and

Martin

2004

Spark

1996

andy

1997

;

When

programmable

computers

ere

ﬁrst

conceiv

ed,

eople

ondered

whether

they

migh

ecome

intelligen

undred

ears

efore

one

was

built

(

elace

1842

oday

artiﬁcial

intel

ligenc

(AI)

thriving

ﬁeld

with

man

practical

applications

and

active

researc

topics.

intelligen

softw

are

automate

routine

lab

or, understand

eech

images, mak

diagnoses

medicine

and

supp

ort

basic

scientiﬁc

research.

the

early

artiﬁcial

telligence,

the

ﬁeld

rapidly

tackled

and

solved

problems

that

are

intellectually

diﬃcult

for

human

eings

but

relativ

ely

straight-

forw

ard

for

computers—problems

that

can

describ

list

formal,

math-

ematical

rules. The

true

challenge

artiﬁcial

intelligence

prov

solving

the

tasks

that

are

easy

for

eople

erform

but

hard

for

eople

describ

formally—problems

that

solve

intuitiv

ely

that

feel

automatic,

recognizing

ords

faces

images.

This

out

solution

these

intuitiv

problems.

This

solution

allow

computers

learn

from

exp

erience

and

understand

the

world

terms

hierarc

concepts,

with

each

concept

deﬁned

terms

its

relation

simpler

concepts.

gathering

knowledge

from

experience,

this

approac

oids

the

need

for

uman

erators

formally

ecify

all

the

knowledge

that

the

computer

needs.

The

hierarc

concepts

allows

the

computer

learn

complicated

concepts

building

them

out

simpler

ones.

draw

graph

showing

how

these

CHAPTER

INTR

ODUCTION

concepts

are

built

top

eac

other,

the

graph

deep,

with

man

lay

ers.

this

reason,

call

this

approach

arning

Man

the

early

successes

took

place

relativ

ely

sterile

and

formal

vironmen

and

did

not

require

computers

kno

wledge

out

the

orld. F

example,

IBM’s

Deep

Blue

chess-pla

ying

system

defeated

orld

hampion

Garry

Kasparo

1997

(

Chess

course

very

simple

Hsu

2002

orld,

con

taining

only

sixt

y-four

cations

and

thirt

y-t

pieces

that

can

mov

only

rigidly

circumscrib

ys.

Devising

successful

chess

strategy

is a

tremendous

accomplishmen

t, but

the

challenge

not

due

the

diﬃculty

describing

the

set

chess

pieces

and

allo

able

mov

the

computer.

Chess

can

completely

describ

very

brief

list

completely

formal

rules,

easily

pro

vided

ahead

time

the

programmer.

Ironically

abstract

and

formal

tasks

that

are

among

the

most

diﬃcult

mental

undertakings

for

uman

eing

are

among

the

easiest

for

computer.

Computers

long

een

able

defeat

even

the

est

human

chess

play

er,

but

are

only

recen

tly

matching

some

the

abilities

erage

human

eings

recognize

jects

eech.

erson’s

everyda

life

requires

immense

amount

kno

wledge

out

the

world.

Much

this

kno

wledge

sub

jectiv

and

intuitiv

and

therefore

diﬃcult

articulate

formal

Computers

need

capture

this

same

kno

wledge

order

eha

telligen

One

the

key

challenges

artiﬁcial

telligence

how

get

this

informal

kno

wledge

into

computer.

Sev

eral

artiﬁcial

telligence

pro

jects

hav

sought

hard-co

knowledge

out

the

worl

formal

languages.

computer

can

reason

out

statements

these

formal

languages

automatically

using

logical

inference

rules.

This

kno

the

know

dge

ase

approac

artiﬁcial

intelligence.

None

these

pro

jects

has

led

jor

success.

One

the

most

famous

such

pro

jects

Cyc

(

Lenat

and

Guha

1989

Cyc

inference

engine

and

database

statements

language

called

CycL.

These

statements

are

tered

staﬀ

human

sup

ervisors.

wieldy

pro

cess.

People

struggle

devise

formal

rules

with

enough

complexity

accurately

describ

the

world.

example,

Cyc

failed

understand

story

out

erson

named

red

shaving

the

morning

(

Its

inference

Linde

1992

engine

detected

inconsistency

the

story: it

knew

that

eople

not

electrical

parts,

but

ecause

red

holding

electric

razor,

elieved

the

tit

“F

redWhileShaving”

contained

electrical

parts.

therefore

ask

whether

red

was

still

erson

while

was

sha

ving.

The

diﬃculties

faced

systems

relying

hard-coded

kno

wledge

suggest

that

systems

need

the

ability

acquire

their

own

kno

wledge,

extracting

patterns

from

data.

This

capabilit

known

machine

arning

The

tro

duction

CHAPTER

INTR

ODUCTION

mac

hine

learning

allo

computers

tackle

problems

inv

olving

knowledge

the

real

orld

and

mak

decisions

that

app

ear

sub

jective.

simple

machine

learning

algorithm

called

gistic

ession

can

determine

whether

recommend

cesarean

deliv

ery

(

Mor-Y

osef

1990

al.

simple

machine

learning

algorithm

called

can

separate

legitimate

e-mail

from

spam

e-mail.

naive

Bayes

The

erformance

these

simple

machine

learning

algorithms

dep

ends

heavily

the

epr

esentation

the

data

they

are

given.

example,

when

logistic

regression

used

recommend

cesarean

deliv

ery

the

system

not

examine

the

patient

directly

Instead,

the

ctor

tells

the

system

several

pieces

relev

information,

suc

the

presence

absence

uterine

scar.

Each

piece

information

included

the

represen

tation

the

patient

known

atur

Logistic

regression

learns

eac

these

features

the

patient

correlates

with

arious

outcomes.

er,

cannot

inﬂuence

the

that

the

features

are

deﬁned

any

. If

logistic

regression

was

given

MRI

scan

the

patient,

rather

than

the

ctor’s

formalized

rep

ort,

would

not

able

mak

useful

predictions.

Individual

pixels

MRI

scan

negligible

correlation

with

complications

that

might

ccur

during

delivery

This

dep

endence

represen

tations

general

phenomenon

that

app

ears

throughout

computer

science

and

even

daily

life.

computer

science,

opera-

tions

suc

searching

collection

data

can

pro

ceed

exp

onentially

faster

the

collection

structured

and

indexed

intelligen

tly

. P

eople

can

easily

erform

arithmetic

Arabic

numerals,

but

ﬁnd

arithmetic

Roman

umerals

time-consuming.

not

surprising

that

the

choice

represen

tation

has

enormous

eﬀect

the

erformance

mac

hine

learning

algorithms.

simple

visual

example,

see

Fig.

1.1

Man

artiﬁcial

intelligence

tasks

can

solv

designing

the

righ

set

features

extract

for

that

task,

then

providing

these

features

simple

machine

learning

algorithm.

example,

useful

feature

for

eak

iden

tiﬁcation

from

sound

estimate

the

size

speaker’s

vocal

tract.

therefore

giv

strong

clue

whether

the

eaker

man,

oman,

hild.

er,

for

man

tasks,

diﬃcult

know

what

features

should

extracted.

example,

supp

ose

that

would

lik

write

program

detect

cars

photographs.

know

that

cars

wheels,

might

use

the

presence

wheel

feature. Unfortunately

diﬃcult

describ

exactly

what

wheel

oks

terms

pixel

alues.

wheel

has

simple

geometric

shap

but

its

image

may

complicated

shadows

falling

the

wheel,

the

sun

glaring

oﬀ

the

metal

parts

the

wheel,

the

fender

the

car

ject

the

foreground

obscuring

part

the

wheel,

and

on.

CHAPTER

INTR

ODUCTION

that

are

directly

observed.

Instead,

they

may

exist

either

unobserv

jects

unobserved

forces

the

ysical

world

that

aﬀect

observ

able

quan

tities.

They

also

exist

constructs

the

uman

mind

that

pro

vide

useful

simplifying

explanations

inferred

causes

the

observ

data.

They

can

thought

concepts

abstractions

that

help

make

sense

the

rich

ariabilit

the

data.

When

analyzing

eech

recording,

the

factors

ariation

include

the

eak

er’s

age,

their

sex,

their

accent

and

the

words

that

they

are

eaking.

When

analyzing

image

car,

the

factors

ariation

include

the

osition

the

car,

its

color,

and

the

angle

and

brightness

the

sun.

jor

source

diﬃcult

many

real-w

orld

artiﬁcial

intelligence

applications

that

many

the

factors

ariation

inﬂuence

ery

single

piece

data

are

able

observe.

The

individual

pixels

image

red

car

migh

ery

black

night.

The

shap

the

car’s

silhouette

dep

ends

the

viewing

angle.

Most

applications

require

the

factors

ariation

and

discard

the

disentangle

ones

that

not

care

about.

course,

can

very

diﬃcult

extract

such

high-level,

abstract

features

from

data.

Man

these

factors

ariation,

such

eak

er’s

accen

can

iden

tiﬁed

only

using

sophisticated,

nearly

human-lev

understanding

the

data.

When

nearly

diﬃcult

obtain

representation

solve

the

original

problem,

representation

learning

not,

ﬁrst

glance,

seem

help

us.

arning

solv

this

central

problem

represen

tation

learning

introduc-

ing

represen

tations

that

are

expressed

terms

other,

simpler

representations.

Deep

learning

allows

the

computer

build

complex

concepts

out

simpler

con-

cepts.

Fig.

shows

how

deep

learning

system

can

represen

the

concept

1.2

image

erson

combining

simpler

concepts,

such

corners

and

contours,

whic

are

turn

deﬁned

terms

edges.

The

quin

tessen

tial

example

deep

learning

del

the

feedforw

ard

deep

net

ork

multilayer

eptr

(MLP).

ultila

erceptron

just

mathe-

matical

function

mapping

some

set

input

alues

output

alues.

The

function

formed

comp

osing

many

simpler

functions.

can

think

each

application

diﬀerent

mathematical

function

pro

viding

new

representation

the

input.

The

idea

learning

the

right

represen

tation

for

the

data

provides

one

ersp

ec-

tiv

deep

learning.

Another

ersp

ective

deep

learning

that

depth

allows

the

computer

learn

ulti-step

computer

program.

Eac

lay

the

represen

tation

can

thought

the

state

the

computer’s

memory

after

executing

another

set

instructions

parallel.

Net

orks

with

greater

depth

can

execute

instructions

sequence.

Sequential

instructions

oﬀer

great

ecause

later

instructions

can

refer

back

the

results

earlier

instructions.

ccording

this

剩余801页未读，继续阅读

captainbnu

粉丝: 0
资源: 2

深度学习入门：概念与数学基础

DeepLearningBook-ReadingNotes, DeepLearningBook 读书会笔记及讲义.zip

DeepLearningBook高清英文最新版PDF

DeepLearningBook.pdf

Deeplearningbook Chinese-version

DeepLearningBook高清中文最新版

deeplearningbook_machinelearning_

deeplearningbook 中英文-含目录

带目录，超高清deeplearningbook-chinese中文本

deeplearningbook-chinese, 深入学习中文中文翻译.zip

基于deeplearningbook-chinese的中文深度学习书籍设计源码

最新资源