Introduction to Spark

Learn the basics of Spark by analyzing guests on The Daily Show.


  • Learn a brief history of a Big Data.
  • Learn about RDD objects and how they work in Spark.
  • Learn the basics of counting in Spark

Mission Outline

1. A Brief History of Big Data
2. The Spark Revolution
3. Resilient Distributed Data Sets (RDDs)
4. SparkContext
5. Lazy Evaluation
6. Pipelines
7. Python and Scala, Friends Forever
8. ReduceByKey()
9. Explanation
10. Filter
11. Practice with Pipelines
12. Next Steps
13. Takeaways

Course Info:

Spark and Map-Reduce


The average completion time for this course is 10-hours.

This course requires a premium subscription and includes four paid missions, one challenge, and one installation tutorial.  It is the 31st course in the Data Scientist In Python path.


Take a Look Inside

Share On Facebook
Share On Twitter
Share On Linkedin
Share On Reddit