没有合适的资源?快使用搜索试试~ 我知道了~
首页my presentation on RDD & Spark.pptx
资源详情
资源评论
资源推荐

Resilient Distributed Datasets (RDD)
A Fault-Tolerant Abstraction for In-Memory Cluster Computing
Paper author: Matei Zaharia et al
Speaker:
My Research:

1. Current Status of Apache Spark
2. Resilient Distributed Datasets, RDD
(paper)
1. Motivation
2. Challenges
3. RDD model
4. Implementation & Evaluation
3. Apache Spark Ecosystem
Outline

1. Current Status of Apache Spark
2. Resilient Distributed Datasets, RDD
(paper)
1. Motivation
2. Challenges
3. RDD model
4. Implementation & Evaluation
3. Apache Spark Ecosystem
Outline

A Brief History:
200
2
200
2
MapReduce @
Google
2004
MapReduce
paper
2006
Hadoop @
Yahoo!
200
4
200
6
200
8
201
0
201
2
201
4
201
4
Apache Spark top-
level
2010
Spark
paper
2008
Hadoop
Summit
201
6
202
0
Spark2.0 Spark3.0

Apache Spark’s Open Source Community
剩余59页未读,继续阅读















安全验证
文档复制为VIP权益,开通VIP直接复制

评论0