Guided Project: Hacker News Pipeline

Learn how to use a data pipeline to summarize hacker news data.


  • Learn to work with JSON API data in Python.
  • Learn to build a real world data pipeline from raw data to summarization.

Mission Outline

1. Introduction to the Data
2. Loading the JSON Data
3. Filtering the Stories
4. Convert to CSV
5. Extract Title Column
6. Clean the Titles
7. Create the Word Frequency Dictionary
8. Sort the Top Words
9. Next Steps

Course Info:

Building a Data Pipeline


The average completion time for this course is 10-hours.

This course requires a premium subscription. This course has one free mission, three paid missions, and one guided project.  It is the seventh course in the Data Engineer Path.


Take a Look Inside

