首页Apache Spark 2.0.0 for Beginners
Apache Spark 2.0.0 for Beginners
需积分: 10 55 浏览量 更新于2023-05-25 评论 收藏 10.9MB PDF 举报
纯英文版Apache Spark 2 for Beginners 本书介绍了介绍了基于rdd的Spark编程、基于数据集的Spark编程、基于Spark sql的数据爆炸来处理结构化数据、基于Spark流的侦听器程序来不断侦听传入的消息并对其进行处理，以及基于Spark graphx的应用程序来处理follower关系。文章最后增加应用程序用例。
Apache Spark 2 for Beginners
Copyright © 2016 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or
transmitted in any form or by any means, without the prior written permission of the
publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the
information presented. However, the information contained in this book is sold without
warranty, either express or implied. Neither the author, nor Packt Publishing, and its
dealers and distributors will be held liable for any damages caused or alleged to be caused
directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the
companies and products mentioned in this book by the appropriate use of capitals.
However, Packt Publishing cannot guarantee the accuracy of this information.
First published: September 2016
Production reference: 1260916
Published by Packt Publishing Ltd.
35 Livery Street
B3 2PB, UK.
Content Development Editor
About the Author
Rajanarayanan Thottuvaikkatumana, Raj, is a seasoned technologist with more than 23
years of software development experience at various multinational companies. He has lived
and worked in India, Singapore, and the USA, and is presently based out of the UK. His
experience includes architecting, designing, and developing software applications. He has
worked on various technologies including major databases, application development
platforms, web technologies, and big data technologies. Since 2000, he has been working
mainly in Java related technologies, and does heavy-duty server-side programming in Java
and Scala. He has worked on very highly concurrent, highly distributed, and high
transaction volume systems. Currently he is building a next generation Hadoop YARN-
based data processing platform and an application suite built with Spark using Scala.
Raj holds one master's degree in Mathematics, one master's degree in Computer
Information Systems and has many certifications in ITIL and cloud computing to his credit.
Raj is the author of Cassandra Design Patterns - Second Edition, published by Packt.
When not working on the assignments his day job demands, Raj is an avid listener to
classical music and watches a lot of tennis.
deprecate email@example.com › firstname.lastname@example.org › email@example.com › @npmcli/move-file@^2.0.0 This functionality has been moved to @npmcli/fs
com.netflix.client.ClientException: Load balancer does not have available server for client: erp降低eureka版本为多少能解决
- 我的内容管理 收起
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额