机器学习实战：Peter Harrington版PDF

5星 · 超过95%的资源需积分: 9 146 浏览量更新于2024-07-25 2 收藏 10.32MB PDF 举报

"Machine Learning in Action(2012.3)] Peter Harrington. 文字版.pdf" 《Machine Learning in Action》是由Peter Harrington撰写的一本关于机器学习的实战指南，出版于2012年，由Manning出版社发行。这本书旨在帮助读者通过实践来理解和掌握机器学习的核心概念和技术。书中的内容涵盖了从基础理论到实际应用的广泛领域，适合对机器学习感兴趣的初学者和有一定经验的从业者。书中可能包括以下关键知识点： 1. **机器学习简介**：介绍机器学习的基本概念，如监督学习、无监督学习、半监督学习和强化学习，以及它们在不同场景下的应用。 2. **数据预处理**：讲解如何清洗、转换和规范化数据，这是机器学习模型构建前的重要步骤，包括缺失值处理、异常值检测、特征缩放等。 3. **算法实现**：涵盖各种经典的机器学习算法，如线性回归、逻辑回归、决策树、随机森林、支持向量机(SVM)、朴素贝叶斯、K近邻(KNN)、聚类算法（如K-means）等，并提供Python代码实现。 4. **模型评估与选择**：讨论如何度量模型的性能，如准确率、召回率、F1分数、AUC-ROC曲线等，以及交叉验证、网格搜索等参数调优技术。 5. **深度学习入门**：虽然2012年的书籍可能不会深入探讨深度学习，但可能会简要介绍神经网络和反向传播的基础知识，为读者进一步探索深度学习奠定基础。 6. **实战项目**：通过实际案例，如文本分类、推荐系统、图像识别等，演示如何将所学应用于解决真实世界的问题。 7. **编程语言支持**：本书很可能使用Python作为主要的编程语言，因为Python是当时和现在都非常流行的机器学习语言，拥有丰富的库如scikit-learn、numpy、pandas等。 8. **数学基础**：解释必要的数学概念，如概率论、统计学、矩阵运算和优化理论，以便读者理解算法背后的原理。 9. **软件工具和库**：介绍如何使用Python的科学计算库（如NumPy、SciPy）、数据处理库（如Pandas）、以及机器学习库（如scikit-learn）。 10. **持续学习和资源**：可能包含一些关于进一步学习机器学习的资源和最新研究的推荐，帮助读者保持对这个快速发展的领域的了解。《Machine Learning in Action》是一本实践导向的机器学习教程，它将理论与实践紧密结合，旨在帮助读者从零开始掌握机器学习，并具备独立解决实际问题的能力。

CONTENTS

15.6 Example: the Pegasos algorithm for distributed SVMs 316

The Pegasos algorithm 317

■

Training: MapReduce support

vector machines with mrjob 318

15.7 Do you really need MapReduce? 322

15.8 Summary 323

appendix A Getting started with Python 325

appendix B Linear algebra 335

appendix C Probability refresher 341

appendix D Resources 345

index 347

PREFACE

xviii

data was not assumed to be uniformly spaced in time, and they covered more algo-

rithms but with less rigor. I later realized that similar methods were also being taught

in the economics, electrical engineering, and computer science departments.

In early 2009, I graduated and moved to Silicon Valley to start work as a software

consultant. Over the next two years, I worked with eight companies on a very wide

range of technologies and saw two trends emerge which make up the major thesis for

this book: first, in order to develop a compelling application you need to do more

than just connect data sources; and second, employers want people who understand

theory and can also program.

A large portion of a programmer’s job can be compared to the concept of connect-

ing pipes—except that instead of pipes, programmers connect the flow of data—and

monstrous fortunes have been made doing exactly that. Let me give you an example.

You could make an application that sells things online—the big picture for this would

be allowing people a way to post things and to view what others have posted. To do this

you could create a web form that allows users to enter data about what they are selling

and then this data would be shipped off to a data store. In order for other users to see

what a user is selling, you would have to ship the data out of the data store and display

it appropriately. I’m sure people will continue to make money this way; however to

make the application really good you need to add a level of intelligence. This intelli-

gence could do things like automatically remove inappropriate postings, detect fraud-

ulent transactions, direct users to things they might like, and forecast site traffic. To

accomplish these objectives, you would need to apply machine learning. The end user

would not know that there is magic going on behind the scenes; to them your applica-

tion “just works,” which is the hallmark of a well-built product.

An organization may choose to hire a group of theoretical people, or “thinkers,”

and a set of practical people, “doers.” The thinkers may have spent a lot of time in aca-

demia, and their day-to-day job may be pulling ideas from papers and modeling them

with very high-level tools or mathematics. The doers interface with the real world by

writing the code and dealing with the imperfections of a non-ideal world, such as

machines that break down or noisy data. Separating thinkers from doers is a bad idea

and successful organizations realize this. (One of the tenets of lean manufacturing is

for the thinkers to get their hands dirty with actual doing.) When there is a limited

amount of money to be spent on hiring, who will get hired more readily—the thinker

or the doer? Probably the doer, but in reality employers want both. Things need to get

built, but when applications call for more demanding algorithms it is useful to have

someone who can read papers, pull out the idea, implement it in real code, and iterate.

I didn’t see a book that addressed the problem of bridging the gap between think-

ers and doers in the context of machine learning algorithms. The goal of this book is

to fill that void, and, along the way, to introduce uses of machine learning algorithms

so that the reader can build better applications.

剩余381页未读，继续阅读

kinsley_zw

粉丝: 104
资源: 3

机器学习实战：Peter Harrington版PDF

机器学习实战_Machine_Learning_in_Action.pdf

Machine Learning In Action pdf

Machine Learning in Action.pdf

Machine Learning in Action 原版PDF by Harrington

《Machine Learning in Action》(Peter Harrington著)

Peter Harrington的《机器学习实战》2012.3文字版

Machine Learning实践：Peter Harrington著高清版

实践机器学习：Harrington的《Machine Learning in Action》2012年第三版

Automated Machine Learning

Hands-onPythonTutorial.pdf

最新资源