MISSION 60

Introduction to Spark

Learn the basics of Spark by analyzing guests on The Daily Show.

Objectives

  • Learn a brief history of a Big Data.
  • Learn about RDD objects and how they work in Spark.
  • Learn the basics of counting in Spark

Mission Outline

1. A Brief History of Big Data
2. The Spark Revolution
3. Resilient Distributed Data Sets (RDDs)
4. SparkContext
5. Lazy Evaluation
6. Pipelines
7. Python and Scala, Friends Forever
8. ReduceByKey()
9. Explanation
10. Filter
11. Practice with Pipelines
12. Next Steps
13. Takeaways

spark-map-reduce

Course Info:

Spark and Map-Reduce

Intermediate

The median completion time for this course is 6 hours.

This course requires a premium subscription and includes five missions, and one installation tutorial.  It is the 31st course in the Data Scientist In Python path.

START LEARNING FREE

Take a Look Inside