MISSION 62

Challenge: Transforming Hamlet into a Data Set

Practice using Spark to transform the text of Hamlet into a usable data set.

Objectives

  • Transforming data from text files into RDD objects.
  • Cleaning data using lambda functions.

Mission Outline

1. Introduction
2. Extract Line Numbers
3. Remove Blank Values
4. Remove Pipe Characters

spark-map-reduce

Course Info:

Spark and Map-Reduce

Intermediate

The average completion time for this course is 10-hours.

This course requires a premium subscription and includes four paid missions, one challenge, and one installation tutorial.  It is the 31st course in the Data Scientist In Python path.

START LEARNING FREE

Take a Look Inside