“Getting a data science job would have been much harder without Dataquest. It’s a great product. I still recommend it to anyone who asks me about how to get started.”

Data Science Manager @Later

Course overview

In your data science career, you’ll rarely get a dataset that is in precisely the state you want. That’s why data cleaning is such an invaluable skill in data science. This course builds on our previous Advanced Data Cleaning course and will make you a valuable asset to any data science team.

After learning how to prepare the data for analysis, the real fun begins — you’ll complete two data analysis and visualization guided projects using data from some of the biggest names in film culture.

Key skills

• Using the “two-phase” process to complete end-to-end data cleaning projects
• Combining, manipulating, exploring, and analyzing multiple datasets
• Completing compelling data cleaning guided projects

Course outline

Data Cleaning Walkthrough 2h

Lesson Objectives
• Research and prepare multiple datasets
• Clean data across multiple datasets

Data Cleaning Walkthrough: Combining the Data 2h

Lesson Objectives
• Combine multiple datasets
• Perform joins in pandas

Data Cleaning Walkthrough: Analyzing and Visualizing the Data 1h

Lesson Objectives
• Compute correlations in pandas
• Map schools using basemap

Guided Project: Analyzing NYC High School Data 1h

Lesson Objectives
• Generate scatter plots to compare columns

Challenge: Cleaning Data 1h

Lesson Objectives
• Clean data in pandas
• Apply functions over columns in pandas

Guided Project: Star Wars Survey 2h

Lesson Objectives
• Clean and map column values in pandas
• Compute summary statistics

Projects in this course

Analyzing NYC High School Data

For this project, you’ll assume the role of a data scientist analyzing relationships between SAT scores and demographic factors in NYC public schools to determine if the SAT is a fair test.

Star Wars Survey

For this project, you’ll become a data analyst exploring FiveThirtyEight’s Star Wars survey data. You’ll use Python and pandas to map values, compute statistics, and analyze the data to uncover fan film preferences.

The Dataquest guarantee

Dataquest has helped thousands of people start new careers in data. If you put in the work and follow our path, you’ll master data skills and grow your career.

We believe so strongly in our paths that we offer a full satisfaction guarantee. If you complete a career path on Dataquest and aren’t satisfied with your outcome, we’ll give you a refund.

Master skills faster with Dataquest

Learn exactly what you need to achieve your goal. Don’t waste time on unrelated lessons.

Build confidence with our in-depth projects, and show off your data skills.

Challenge yourself with exercises

Work with real data from day one with interactive lessons and hands-on exercises.

Share the evidence of your hard work with your network and potential employers.

98%
of learners recommend
4.85
Dataquest rating
SwitchUp Best Bootcamps
\$30k
Average salary boost
for learners who complete a path

Aaron Melton

“Dataquest starts at the most basic level, so a beginner can understand the concepts. I tried learning to code before, using Codecademy and Coursera. I struggled because I had no background in coding, and I was spending a lot of time Googling. Dataquest helped me actually learn.”

Jessica Ko

“I liked the interactive environment on Dataquest. The material was clear and well organized. I spent more time practicing then watching videos and it made me want to keep learning.”

Victoria E. Guzik

Associate Data Scientist at Callisto Media

“I really love learning on Dataquest. I looked into a couple of other options and I found that they were much too handhold-y and fill in the blank relative to Dataquest’s method. The projects on Dataquest were key to getting my job. I doubled my income!”

1

2

3

4