深度强化学习入门

需积分: 23 80 浏览量更新于2024-07-09 收藏 2.46MB PDF 举报

"An Introduction to Deep Reinforcement Learning" 是一本由Vincent François-Lavet等人编写的书籍，于2018年在《机器学习基础与趋势》上发表。该书深入探讨了深度强化学习这一领域，是理解这个交叉学科的重要资源。深度强化学习是机器学习的一个分支，它结合了深度学习的复杂模型处理能力和强化学习的决策制定策略。书中首先介绍了机器学习和深度学习的基础，接着深入到强化学习的世界。在机器学习部分，作者详细讨论了监督学习，包括偏差和过拟合的概念，这是理解和优化学习算法的关键。此外，还简述了无监督学习，这种学习方式允许模型从未标记的数据中自我学习模式。深度学习方法部分，作者阐述了如何利用神经网络来构建复杂模型，解决传统机器学习算法难以处理的高维度和非线性问题。这些网络能够学习并表示大量的输入数据，并在多个层次上进行抽象。进入强化学习章节，书中定义了正式的学习框架，这是一个马尔可夫决策过程（MDP），强调了状态、动作、奖励和动态等关键元素。接着，书中介绍了学习策略的不同组件，如价值函数和策略函数，这些都是解决环境互动问题的核心工具。此外，书中还探讨了从数据中学习策略的不同设置，包括在线学习和离线学习，以及它们各自的挑战和适用场景。特别是，对于深度强化学习，即使用深度神经网络作为函数近似器来估计值函数或策略，是解决大型、高维度状态空间问题的关键。价值基于的方法在深度强化学习中占据重要地位，这种方法主要依赖于预测未来奖励的总和（即价值）来指导决策。Q学习和深度Q网络（DQN）是这一领域的里程碑式工作，它们通过经验回放缓冲区和目标网络等技术解决了稳定学习的问题。这本书为读者提供了一个全面的框架，用于理解和实践深度强化学习，涵盖了从基本概念到最新技术的广泛内容，是学习这个领域的宝贵参考资料。

12 Machine learning and deep learning

in the last few years is to have an ever-growing number of layers, with

more than 100 in some supervised learning tasks (Szegedy et al., 2017).

We merely describe here two types of layers that are of particular interest

in deep RL (and in many other tasks).

Convolutional layers (LeCun, Bengio, et al., 1995) are particularly

well suited for images and sequential data (see Fig 2.3), mainly due

to their translation invariance property. The layer’s parameters consist

of a set of learnable ﬁlters (or kernels), which have a small receptive

ﬁeld and which apply a convolution operation to the input, passing

the result to the next layer. As a result, the network learns ﬁlters that

activate when it detects some speciﬁc features. In image classiﬁcation,

the ﬁrst layers learn how to detect edges, textures and patterns; then

the following layers are able to detect parts of objects and whole objects

(Erhan et al., 2009; Olah et al., 2017). In fact, a convolutional layer is

a particular kind of feedforward layer, with the speciﬁcity that many

weights are set to 0 (not learnable) and that other weights are shared.

input image

or input feature map

ﬁlter

0 1

1 0

output feature maps

Figure 2.3:

Illustration of a convolutional layer with one input feature map that is

convolved by diﬀerent ﬁlters to yield the output feature maps. The parameters that

are learned for this type of layer are those of the ﬁlters. For illustration purposes,

some results are displayed for one of the output feature maps with a given ﬁlter (in

practice, that operation is followed by a non-linear activation function).

剩余139页未读，继续阅读

qq_43122429

粉丝: 0
资源: 1

深度强化学习入门

An Introduction to Deep Reinforcement Learning

Reinforcement Learning: An Introduction最新版习题解答（第一版本）

增强学习导论（Reinforcement Learning An Introduction_Sutton）

帮助学习强化学习的书籍

适合人工智能深度学习的参考文献

关于深度学习的外文文献

reinforcement learning : an introduction

tensorflow 强化学习 书

给出自主机器学习的参考文献

dqn算法书籍推荐系统

最新资源

tensorflow 强化学习书