深度学习入门经典:数学与基础工具详解

需积分: 10 31 下载量 156 浏览量 更新于2024-07-17 收藏 23.94MB PDF 举报
"《深度学习Cookbook》是一本专为想要深入了解和应用深度学习技术的专业人士设计的权威指南,它不仅涵盖了深度学习的基础理论,还提供了实用的工具和实例,适合初学者和经验丰富的从业者查阅。本书由深度学习领域的知名专家Ian Goodfellow、Yoshua Bengio和Aaron Courville合著,英文原版出版,非扫描版确保了高质量的内容。 正文开始: 第一部分,"Introduction",旨在引导读者了解深度学习的历史趋势以及它在当今信息技术中的重要性。书中详细解释了谁应该阅读这本书:无论是初入深度学习领域的学生,还是希望提升现有技能的工程师,都能从中找到适合的学习路径。作者会回顾自上世纪50年代以来深度学习的发展,强调深度神经网络如何从简单的模型发展到如今能够解决复杂问题的强大工具。 第二部分,"Applied Math and Machine Learning Basics",深入浅出地介绍了线性代数等数学基础知识,这些都是深度学习的基石。章节包括了:向量和矩阵的基本概念,如标量、矢量、矩阵和张量的操作;矩阵乘法、单位矩阵与逆矩阵的计算;线性依赖与向量空间的定义,以及向量的范数和特殊类型的矩阵。此外,还有特征值分解(Eigen decomposition)、奇异值分解(SVD)以及 Moore-Penrose 倒数等高级概念,这些都是实现神经网络训练和优化的核心技巧。 第三部分,"Probability and Information Theory",探讨了概率论和信息论在深度学习中的应用。这部分强调了概率在理解随机性和不确定性上的核心作用。读者将学习随机变量、概率分布及其性质,如边际概率、条件概率、独立性和条件独立性,以及期望、方差和协方差等统计概念。这些知识对于处理模型的不确定性和决策制定至关重要。 通过每章精心编排的练习和实际案例,读者可以逐步掌握深度学习的基本原理和实践技巧。《深度学习Cookbook》不仅是理论教材,也是深度学习工程师的实用参考手册,是进入这个领域不可或缺的参考资料。无论是理论探索者还是实践经验者,这本书都能提供一个扎实且全面的深度学习学习路径。"
2018-07-25 上传
While the boom in computational power and better techniques led to an increase in interest in neural networks, we have also seen huge strides in usability. In particular, deep learning frameworks like TensorFlow, Theano, and Torch allow nonexperts to construct complex neural networks to solve their own machine learning problems. This has turned a task that used to require months or years of handcoding and head-on-table-banging effort (writing efficient GPU kernels is hard!) into something that anyone can do in an afternoon (or really a few days in practice). Increased usability has greatly increased the number of researchers who can work on deep learning problems. Frameworks like Keras with an even higher level of abstraction make it possible for anyone with a working knowledge of Python and some tools to run some interesting experiments, as this book will show. A second important factor for “why now” is that large datasets have become available for everybody. Yes, Facebook and Google might still have the upper hand with access to billions of pictures, user comments, and what have you, but datasets with millions of items can be had from a variety of sources. In Chapter 1 we’ll look at a variety of options, and throughout the book the example code for each chapter will usually show in the first recipe how to get the needed training data. At the same time, private companies have started to produce and collect orders of magnitude more data, which has made the whole area of deep learning suddenly commercially very interesting. A model that can tell the difference between a cat and a dog is all very well, but a model that increases sales by 15% by taking all historic sales data into account can be the difference between life and death for a company.