没有合适的资源?快使用搜索试试~ 我知道了~
首页Data Algorithms Recipes for Scaling Up with Hadoop and Spark 无水印pdf 0分
Data Algorithms Recipes for Scaling Up with Hadoop and Spark 无水印...

Data Algorithms Recipes for Scaling Up with Hadoop and Spark 英文无水印pdf pdf使用FoxitReader和PDF-XChangeViewer测试可以打开
资源详情
资源评论
资源推荐

Mahmoud Parsian
Data
Algorithms
RECIPES FOR SCALING UP WITH HADOOP AND SPARK

DATAMATH
Data Algorithms
ISBN: 978-1-491-90618-7
US $69.99 CAN $80.99
Twitter: @oreillymedia
facebook.com/oreilly
If you are ready to dive into the MapReduce framework for processing
large datasets, this practical book takes you step by step through
the algorithms and tools you need to build distributed MapReduce
applications with Apache Hadoop or Apache Spark. Each chapter provides
a recipe for solving a massive computational problem, such as building a
recommendation system. You’ll learn how to implement the appropriate
MapReduce solution with code that you can use in your projects.
Dr. Mahmoud Parsian covers basic design patterns, optimization techniques,
and data mining and machine learning solutions for problems in bioinformatics,
genomics, statistics, and social network analysis. This book also includes an
overview of MapReduce, Hadoop, and Spark.
Topics include:
■ Market basket analysis for a large set of transactions
■ Data mining algorithms (K-means, KNN, and Naive Bayes)
■ Using huge genomic data to sequence DNA and RNA
■ Naive Bayes theorem and Markov chains for data and market
prediction
■ Recommendation algorithms and pairwise document similarity
■ Linear regression, Cox regression, and Pearson correlation
■ Allelic frequency and mining DNA
■ Social network analysis (recommendation systems, counting
triangles, sentiment analysis)
Mahmoud Parsian, PhD in Computer Science, is a practicing software professional with
30 years of experience as a developer, designer, architect, and author. Currently the leader
of Illumina’s Big Data team, he’s spent the past 15 years working with Java (server-side),
databases, MapReduce, and distributed computing. Mahmoud is the author of JDBC
Recipes and JDBC Metadata, MySQL, and Oracle Recipes (both Apress).
Mahmoud Parsian
Data
Algorithms
RECIPES FOR SCALING UP WITH HADOOP AND SPARK
Data
Algorithms
Parsian

Mahmoud Parsian
Boston
Data Algorithms

978-1-491-90618-7
[LSI]
Data Algorithms
by Mahmoud Parsian
Copyright © 2015 Mahmoud Parsian. All rights reserved.
Printed in the United States of America.
Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472.
O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are
also available for most titles (http://safaribooksonline.com). For more information, contact our corporate/
institutional sales department: 800-998-9938 or corporate@oreilly.com.
Editors: Ann Spencer and Marie Beaugureau
Production Editor: Matthew Hacker
Copyeditor: Rachel Monaghan
Proofreader: Rachel Head
Indexer: Judith McConville
Interior Designer: David Futato
Cover Designer: Ellie Volckhausen
Illustrator: Rebecca Demarest
July 2015: First Edition
Revision History for the First Edition
2015-07-10: First Release
See http://oreilly.com/catalog/errata.csp?isbn=9781491906187 for release details.
While the publisher and the author have used good faith efforts to ensure that the information and
instructions contained in this work are accurate, the publisher and the author disclaim all responsibility
for errors or omissions, including without limitation responsibility for damages resulting from the use of
or reliance on this work. Use of the information and instructions contained in this work is at your own
risk. If any code samples or other technology this work contains or describes is subject to open source
licenses or the intellectual property rights of others, it is your responsibility to ensure that your use
thereof complies with such licenses and/or rights.

is book is dedicated to my dear family:
wife, Behnaz,
daughter, Maral,
son, Yaseen
剩余777页未读,继续阅读















安全验证
文档复制为VIP权益,开通VIP直接复制

评论1