掌握Big Data SMACK:Apache Spark、Mesos等技术指南

需积分: 10 14 下载量 118 浏览量 更新于2024-07-19 收藏 11.09MB PDF 举报
"《Big Data SMACK:Apache Spark、Mesos、Akka、Cassandra和Kafka指南》一书由Raul Estrada和Isaac Ruiz合著,于2016年出版,针对大数据领域中的热门技术栈SMACK(Spark++)进行了深入讲解。SMACK这个名字来源于这些组件的集合,它们在当前和未来的大数据处理中占据了主导地位。 书中强调,在2014、2015和2016年的开发者薪资调查中,数据工程师、数据科学家和数据架构师的收入水平较高,反映出大数据技术在IT行业的巨大需求。传统上,处理大量数据的工作主要由拥有博士学位的顶尖大学出身的专业人士负责,然而随着技术的发展,这种格局正在改变。Apache Spark以其开源特性颠覆了这一行业,因为它打破了大型企业对数据处理平台的垄断。由于开源社区的广泛参与,Spark等工具相较于专有软件更具优势,因为开源项目能够吸引众多开发者的贡献,从而实现更强大的功能。 Spark的特点之一是易于安装和部署,它能够在个人笔记本上轻松搭建,这对于开发者来说是个福音,特别是对于初创公司和小型企业,无需投入大量的生产环境或大型实验室即可进行开发。这种灵活性使得更多人有机会接触到大数据开发,并从中受益。 本书的目的在于帮助读者掌握SMACK技术栈,理解其如何在未来成为主流。通过学习Spark(分布式计算框架)、Mesos(资源管理系统)、Akka(高性能并行编程框架)、Cassandra(分布式数据库系统)以及Kafka(实时流处理平台),读者不仅能提升自己的技能,还能适应行业趋势,提高职业竞争力。对于希望成为高薪IT专业人士或者已经在该领域并且寻求未来发展趋势的人来说,这本书是一份宝贵的资源。 版权信息表明,所有内容受版权保护,未经许可不得复制、翻译、重印或以任何形式传播。书中可能包含商标名称、标志和图像,使用时需遵循相关商标使用规定。《Big Data SMACK》是一本实用且权威的大数据技术指南,适合数据从业者和爱好者深入学习和实践。"
2016-10-26 上传
Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka by Raul Estrada, Isaac Ruiz English | ISBN: 1484221745 | 2016 | EPUB | 264 pages | 2.35 MB This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large datasets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses. Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer: The language: Scala The engine: Spark (SQL, MLib, Streaming, GraphX) The container: Mesos, Docker The view: Akka The storage: Cassandra The message broker: Kafka What you’ll learn How to make big data architecture without using complex Greek letter architectures. How to build a cheap but effective cluster infrastructure. How to make queries, reports, and graphs that business demands. How to manage and exploit unstructured and No-SQL data sources. How use tools to monitor the performance of your architecture. How to integrate all technologies and decide which replace and which reinforce. Who This Book Is For This book is for developers, data architects, and data scientists looking for how to integrate the most successful big data open stack architecture and how to choose the correct technology in every layer.