深度学习基础到实践:卷1

需积分: 10 17 下载量 11 浏览量 更新于2024-07-18 2 收藏 130.28MB PDF 举报
"《Deep Learning Vol. 1 From Basics to Practice》是由Andrew Glassner编写的深度学习基础到实践的教程,涵盖了从机器学习基础知识到深度学习核心算法的多个章节。" 该书详细介绍了深度学习的基础知识,从机器学习的入门概念开始,逐渐深入到统计学、概率论以及贝叶斯规则等数学基础。这些章节对于理解和构建深度学习模型至关重要,因为它们提供了对数据建模和预测的基本理解。 统计学章节涉及了数据分析的基础,包括描述性统计、假设检验和回归分析,这些都是理解和解释数据的关键工具。概率论章节则讲解了随机事件的概率、条件概率以及联合概率,这些都是构建概率模型和理解不确定性所必需的。而贝叶斯规则是概率论中的一个核心概念,用于更新先验知识并计算后验概率,对于推理和决策过程具有重要意义。 接着,书中探讨了曲线与曲面,这是理解多维数据和神经网络中权重空间的重要概念。信息理论章节则讨论了熵、互信息和条件熵,这些概念在优化模型和压缩信息时起着重要作用。 在机器学习部分,书中详细介绍了分类问题,讨论了监督学习中的训练与测试过程,以及过拟合和欠拟合的概念。这些内容对于避免模型在新数据上的泛化能力降低至关重要。随后,书中还涉及了神经元模型,以及学习和推理的过程。 数据预处理是机器学习中的重要步骤,第12章对此进行了讲解,包括特征选择、标准化和归一化等技术。接着,第13章介绍了各种分类器,如逻辑回归、支持向量机等,并在第14章讨论了集成方法,如随机森林和梯度提升,以提高模型的准确性和稳定性。 第15章专门介绍了Scikit-Learn,这是一个广泛使用的Python机器学习库,包含多种预训练的分类、回归和聚类算法。后续章节则转向深度学习的核心内容,包括前馈神经网络、激活函数(如Sigmoid、ReLU)以及反向传播算法,这些都是构建和训练深度学习模型的基础。最后,书中还讨论了优化器,如梯度下降的不同变种,用于改善模型训练的效率和性能。 《Deep Learning Vol. 1 From Basics to Practice》是一部全面介绍深度学习的教材,从基础知识出发,逐步引导读者进入深度学习的世界,为实践中的深度学习项目提供了坚实的基础。通过阅读本书,读者可以系统地掌握深度学习所需的理论知识和实践技巧。
2019-01-06 上传
Deep Learning, Vol. 1: From Basics to Practice By 作者: Andrew Glassner Pub Date: 2018 ISBN: n/a Pages: (909 of 1750) Language: English Format: PDF People are using the tools of deep learning to change how we think about science, art, engineering, business, medicine, and even music. This book is for people who want to understand this field well enough to create deep learning systems, train them, and then use them with confidence to make their own contributions. The book takes a friendly, informal approach. Our goal is to make the ideas of this field simple and accessible to everyone, as shown in the Contents below. Since most practitioners today use one of several free, open-source deep-learning libraries to build their systems, the hard part isn’t in the programming. Rather, it’s knowing what tools to use, and when, and how. Building a working deep learning system requires making a series of technically informed choices, and with today’s tools, those choices require understanding what’s going on under the hood. This book is designed to give you that understanding. You’ll be able to choose the right kind of architecture, how to build a system that can learn, how to train it, and then how to use it to accomplish your goals. You’ll be able to read and understand the documentation for whatever library you’d like to use. And you’ll be able to follow exciting, on-going breakthroughs as they appear, because you’ll have the knowledge and vocabulary that let you read new material, and discuss it with other people doing deep learning. The book is extensively illustrated with over 1000 original figures. They are also all available for free download, for your own use. You don’t need any previous experience with machine learning or deep learning for this book. You don’t need to be a mathematician, because there’s nothing in the book harder than the occasional multiplication. You don’t need to choose a particular programming language, or library, or piece of hardware, because our approach is largely independent of those things. Our focus is on the principles and techniques that are applicable to any language, library, and hardware. Even so, practical programming is important. To stay focused, we gather our programming discussions into 3 chapters that show how to use two important and free Python libraries. Both chapters come with extensive Jupyter notebooks that contain all the code. Other chapters also offer notebooks for for every Python-generated figure. Our goal is to give you all the basics you need to understand deep learning, and then show how to use those ideas to construct your own systems. Everything is covered from the ground up, culminating in working systems illustrated with running code. The book is organized into two volumes. Volume 1 covers the basic ideas that support the field, and which form the core understanding for using these methods well. Volume 2 puts these principles into practice. Deep learning is fast becoming part of the intellectual toolkit used by scientists, artists, executives, doctors, musicians, and anyone else who wants to discover the information hiding in their data, paintings, business reports, test results, musical scores, and more.