MISSION 267

Guided Project: Hacker News Pipeline

Learn how to use a data pipeline to summarize hacker news data.

Objectives

  • Learn to work with JSON API data in Python.
  • Learn to build a real world data pipeline from raw data to summarization.

Mission Outline

1. Introduction to the Data
2. Loading the JSON Data
3. Filtering the Stories
4. Convert to CSV
5. Extract Title Column
6. Clean the Titles
7. Create the Word Frequency Dictionary
8. Sort the Top Words
9. Next Steps

building-a-data-pipeline

Course Info:

Building a Data Pipeline

Intermediate

The median completion time for this course is 6.3 hours.

This course requires a premium subscription. This course has four missions, and one guided project.  It is the seventh course in the Data Engineer Path.

START LEARNING FREE

Take a Look Inside