PATH

Data Scientist

Learn how to make inferences and predictions from data.

This path covers everything you need to learn to work as a data scientist using Python.

You'll learn the Python fundamentals, dig into data analysis and data viz, query databases with SQL, study statistics, and dig into building machine learning models all over the course of this carefully designed course path.

It's designed so that there are no prerequisites and no prior experience required. Everything you need to learn, you'll learn on this path!

As you learn, you'll apply each concept immediately by writing code right in your browser that's automatically checked by our system to give you near-instant feedback on your progress. 

We think the best way to learn is to learn by doing, so you'll be challenged every step of the way to really apply the concepts you're learning, and you'll build a variety of projects using real-world data to solve real data science problems.

By the end of this path, you'll have the skills you need to work as a data scientist, and you'll be comfortable with things like:

  • How to program in popular data science languages.
  • How to clean and visualize data.
  • How to make predictions using statistics and machine learning.
  • Collaboration tools like git and SQL databases.

START LEARNING

60+ FREE MISSIONS

By creating an account you agree to accept our terms of use and privacy policy.

Become a Data Scientist

Python for Data Science: Fundamentals

Learn the basics of Python programming and data science.

Python for Data Science: Intermediate

Learn the basics of Python programming and data science.

Pandas & NumPy Fundamentals

Learn how to analyze data using the pandas and NumPy libraries.

FREE + BASIC

Exploratory Data Visualization

Learn how to explore data by creating and interpreting data graphics. This course is taught using matplotlib and pandas.

Storytelling Through Data Visualization

Learn how to communicate insights and tell stories using data visualization.

Data Cleaning and Analysis

Learn how to clean and combine datasets, then practice your skills.

Data Cleaning in Python: Advanced

Learn advanced techniques for cleaning data in Python.

Data Cleaning Project Walkthrough

Learn how to clean and combine datasets, then practice your skills.

Elements of the Command Line

Learn the basics of the Bash to establish a foundation of working the command line as a springboard to using the command line for data science

Text Processing in the Command Line

Learn more about the command line and how to use it in your data science workflow.

SQL Fundamentals

Learn the basics of working with SQL databases.

SQL Intermediate: Table Relations & Joins

Learn to work with multi-table databases.

SQL & Databases: Advanced

Learn how to work with PostgreSQL, customize databases using indexing and how to improve database performance.

APIs & Web Scraping

Learn how to acquire data from APIs and the web.

Statistics: Fundamentals

Learn about sampling, variables and distributions.

Statistics Intermediate: Averages & Variability

Learn to summarize distributions, measure variability using variance or standard deviation, and compare values using z-scores.

Probability Fundamentals

Learn the fundamentals of probability theory using Python

Hypothesis Testing: Fundamentals

Learn more advanced statistical concepts including A/B tests and chi-squared tests for more powerful data analysis.

Machine Learning Fundamentals

Learn the fundamentals of machine learning using k-nearest neighbors.

Calculus for Machine Learning

Learn the calculus necessary for intermediate machine learning techniques like linear regression.

Linear Algebra for Machine Learning

Learn the linear algebra necessary for intermediate machine learning techniques like linear regression.

Linear Regression for Machine Learning

Learn how to use the linear regression machine learning model.

Machine Learning in Python: Intermediate

Dive more into Machine learning.

Decision Trees

Learn how to construct and interpret decision trees.

Deep Learning: Fundamentals

Learn the basics of deep neural networks. Includes graph representation, activation functions, multiple hidden layers, and image classification.

Machine Learning Project

Learn what a complete data science project looks like, from data cleaning to machine learning.

Kaggle Fundamentals

Learn how to get started with and participate in Kaggle competitions with Kaggle's 'Titanic' competition.

Exploring Topics in Data Science

Explore other topics in data science, like NLP and clustering.

Natural Language Processing

Learn how to analyze and make predictions on textual data.

Data Structures & Algorithms

Learn how computers work and how they work with data.

Python Programming Advanced

Learn advanced concepts in Python, including more on object-oriented programming, lambda functions, and exception handling.

Command Line: Intermediate

Learn more about the command line and how to use it in your data analysis workflow.

Git & Version Control

Learn the basics of Python programming and data science.

Spark & Map-Reduce

Learn how to use Apache Spark and the map-reduce technique to clean and analyze large datasets.