Data Science Projects

Discover a wide variety of guided projects that let you work with real data in real-world scenarios while learning and applying new data science skills. Our projects are designed by experienced data scientists and reflect the challenges faced in the field. They guide you through the process, challenge your skills, and offer flexibility to add your own twist.

You can complete each project right in your browser, or you can download the data set to your computer and work locally! If you work on our site, you’ll also be able to download your code at any time so that you can continue locally, or upload your project to GitHub.

Intermediate Python for Web Development

Guided Project: Garden Simulator Text-Based Game

Create a gardening game. From foraging seeds to nurturing plants and selling the produce, every acti...
Python Functions and Jupyter Notebook

Project: Learn and Install Jupyter Notebook

Learn the basics of Jupyter Notebook
Python Dictionaries, APIs, and Functions for Web Development

Guided Project: Build a Food Ordering App

Create a functional and interactive food ordering application using Python. You’ll use diction...
Introduction to Python Programming for Web Development

Guided Project: Word Raider

Create a functional and interactive word-guessing game using Python. You’ll use a variables, l...
Natural Language Processing for Deep Learning

Guided Project: Classifying Disaster-Related Tweets as Real or Fake

Build a deep learning text classification model to predict whether a given tweet is about a real dis...
Convolutional Neural Networks for Deep Learning

Guided Project: Detect Pneumonia Using X-Ray Images with CNNs and Transfer Learning

Build and train multiple deep learning models to detect pneumonia in images of chest X-rays. 
Sequence Models for Deep Learning

Guided Project: Time-Series Forecasting on the S&P 500

Build, train, and evaluate an LSTM model with a convolutional layer for S&P 500 index (stock) pr...
Introduction to Python Programming

Guided Project: Kaggle Data Science Survey

For this project, we’ll act as a data analyst for Kaggle. Kaggle surveyed a large population of da...
Introduction to Deep Learning in TensorFlow

Guided Project: Predicting Listing Gains in the Indian IPO Market Using TensorFlow

Build a deep learning model to predict the listing gains of IPOs in the Indian market.
Sharing Insights in Tableau

Guided Project: Data Presentation

A guided project to finish the data presentation module.
Data Visualization Fundamentals in Tableau

Guided Project: Business Intelligence Plots

Practice creating data visualizations in this guided project.
Data Preparation in Tableau

Guided Project — Data Prep in Tableau

Guided Project — Data Prep in Tableau
Analyzing Data in Excel

Guided Project: Analyzing Retail Sales

Work with retail sales data to explore trends and relationships. Build basic models to confirm the s...
Exploring Data in Excel

Guided Project: Identifying Customers Likely to Churn for a Telecommunications Provider

In this project, we’ll conduct an Exploratory Data Analysis (EDA) on data from a telecommunica...
Visualizing Data in Excel

Guided Project: Visualizing the Answer to Stock Questions Using Spreadsheet Charts

In this project, we will apply the knowledge and skills we developed in the preceding lessons to pre...
Preparing Data in Excel

Guided Project: Preparing Data with Excel

In this guided project, you’ll import a dataset, organize it, clean it, and then consolidate it in...
Manage Workspaces and Datasets in Power BI

Guided Project: Building a BI App

Build a business intelligence app in Power BI that visualizes course quality and completion data for...
Visualize data in Power BI

Guided Project: Visualization of Life Expectancy and GDP Variation Over Time

Apply what you’ve learned to a real-world database.
Model Data in Power BI

Guided Project: Modeling Data in Power BI

Apply data cleaning and data model building to real-world scenarios
SQL Subqueries

Guided Project: Customers and Products Analysis Using SQL

Work on a real-life project using SQL
Introduction to Interactive Web Applications in Shiny

Guided Project: Creating a Project Portfolio

Create a Shiny app to showcase your personal data projects
Introduction to Machine Learning in R

Guided Project: Predicting Car Prices

Practice the machine learning workflow using k-nearest neighbors to predict car prices
Linear Regression Modeling in R

Guided Project - Predicting Condominium Sale Prices

Combine the skills you learned in this course to perform linear regression modeling of condominium s...
Hypothesis Testing in R

Guided Project: Winning Jeopardy

Learn how to analyze text while figuring out strategies to win at Jeopardy.
Conditional Probability in R

Guided Project: Building a Spam Filter with Naive Bayes

Learn to use conditional probability and Naive Bayes in a practical setting.
Introduction to Probability in R

Guided Project: Mobile App for Lottery Addiction

Learn to use probability and combinatorics in a practical setting.
Intermediate Statistics in R

Guided Project: Finding the Best Markets to Advertise In

Learn to combine the skills you learned in this course to perform practical data analysis.
Introduction to Statistics in R

Guided Project: Investigating Fandango Movie Ratings

Learn to combine the skills you learned in this course to perform practical data analysis.
Introduction to Web Scraping in R

Guided Project: Analyzing Movie Ratings

Learn to combine the skills you learned in this course to perform practical web data extraction.
Intermediate SQL in R

Guided Project: Answering Business Questions using SQL

Practice the SQL data analysis workflow.
Introduction to APIs in R

Guided Project: New York Solar Resource Data

Learn to combine the skills you learned in this course to perform practical data extraction.
Introduction to Data Cleaning in R

Guided Project: NYC Schools Perceptions

Practice your data cleaning and analysis skills and learn to use R Notebooks as you explore survey d...
Introduction to Data Visualization in R

Guided Project: Analyzing Forest Fire Data

Use data visualization techniques to explore data on forest fires.
Specialized Data Processing in R

Guided Project: Creating An Efficient Data Analysis Workflow, Part 2

Employ even more programming techniques to create a reusable data workflow.
Control Flow, Iteration, and Functions in R

Guided Project: Creating An Efficient Data Analysis Workflow

Apply control flow, loops and functions to create a reusable data workflow.
Data Structures in R

Guided Project: Investigating COVID-19 Virus Trends

Learn to combine the skills you learned in this course to perform practical data analysis.
Introduction to Data Analysis in R

Project: Install RStudio

Learn how to install and use RStudio, a free and open-source development environment for R.
Building a Data Pipeline

Guided Project: Hacker News Pipeline

Learn how to use a data pipeline to summarize hacker news data.
Recursion and Trees for Data Engineering

Guided Project: Implementing a Key-Value Database

Use a B-Tree to implement a key-value datastore in Python.
Introduction to Data Structures

Guided Project: Analyzing Stock Prices

Use the correct data structures to analyze stock prices efficiently
Parallel Processing for Data Engineering

Guided Project: Analyzing Wikipedia Pages

Use MapReduce to analyze Wikipedia pages more quickly.
Processing Large Datasets In Pandas

Guided Project: Practice Optimizing DataFrames and Processing in Chunks

Practice optimizing DataFrame types and working in chunks.
PostgresSQL for Data Engineering

Project: PostgreSQL Installation

Learn how to install PostgreSQL and the Psycopg2 library.
Introduction to Algorithms

Guided Project: Building Fast Queries on a CSV

Apply what you have learned to implement an inventory system for a laptop store with efficient queri...
Intermediate Python for Data Engineering

Guided Project: Exploring Hacker News Posts

Practice using loops, cleaning strings, and working with dates in python.
Analyzing Large Datasets in Spark and Map-Reduce

Project: Spark Installation and Jupyter Notebook Integration

Learn how to set up PySpark and integrate it with Jupyter Notebook.
Optimizing Machine Learning Models in Python

Guided Project: Optimizing Model Prediction

In this guided project, we will predict the damage done by forest fires and improve upon an existing...
Decision Tree and Random Forest Modeling in Python

Guided Project: Predicting Employee Productivity Using Tree Models

In this project, using the “Productivity Prediction of Garment Employees” dataset from t...
Logistic Regression Modeling in Python

Guided Project: Classifying Heart Disease

In this guided project, you will practice the machine learning workflow and practice creating and op...
Gradient Descent Modeling in Python

Guided Project: Stochastic Gradient Descent on Linear Regression

In this project, you will load, explore, and prepare a dataset to build a stochastic gradient descen...
Linear Regression Modeling in Python

Guided Project: Predicting Insurance Costs

In this guided project, practice linear regression modeling and evaluation.
Introduction to Unsupervised Machine Learning in Python

Guided Project: Credit Card Customer Segmentation

For this project, we’ll build a clustering model to segment credit card customers into different g...
Introduction to Supervised Machine Learning in Python

Guided Project: Predicting Heart Disease

Build a K Nearest Neighbors classifier to predict whether patients might be at risk of heart disease...
Introduction to Git and Version Control

Project: Git Installation and GitHub Integration

Learn how to install Git and authenticate with GitHub.
Data Analysis for Business in Python

Guided Project: Popular Data Science Questions

Apply the skills you’ve learned to determine the creation of data science content.
SQL Fundamentals

Guided Project: Analyzing CIA Factbook Data Using SQL

Practice using SQL to analyze CIA Factbook data.
Data Cleaning Project Walkthrough

Guided Project: Analyzing NYC High School Data

Perform additional analysis on a clean dataset with maps and visualizations.
Data Cleaning and Analysis in Python

Guided Project: Clean and Analyze Employee Exit Surveys

Practice data cleaning using pandas.
Telling Stories Using Data Visualization and Information Design

Guided Project: Storytelling Data Visualization on Exchange Rates

Combine the skills you’ve learned to create a storytelling data visualization.
Introduction to Data Visualization in Python

Guided Project: Finding Heavy Traffic Indicators on I-94

Learn to analyze data using exploratory data visualization.
Introduction to Pandas and NumPy for Data Analysis

Guided Project: Exploring eBay Car Sales Data

Practice data cleaning and data exploration using pandas
For Loops and Conditional Statements in Python

Guided Project: Prison Break

Learn the basics of Jupyter Notebook by analyzing a dataset on helicopter prison escapes