Data Science Projects

If you’re learning data science, you’re probably on the lookout for cool data science projects. Look no further! We have a wide variety of guided projects that’ll get you working with real data in real-world scenarios while also helping you learn and apply new data science skills.

The projects in the list below are also designed to help you get a job! Each project was designed by a data scientist on our content team, and they’re representative examples of the real projects working data analysts and data scientists do every day. They’re designed to guide you through the process while also challenging your skills, and they’re open-ended so that you can put your own twist on each project and use it for your data science portfolio.

You can complete each project right in your browser, or you can download the data set to your computer and work locally! If you work on our site, you’ll also be able to download your code at any time so that you can continue locally, or upload your project to GitHub.

The sky is the limit here and what you decide to look into further is completely up to you and your imagination!

Guided Project: Analyzing Kickstarter Projects

<p>For this guided project, you”ll take on the role of a data analyst
at a startup. The product team is considering launching a campaign on Kickstarter
to test the viability of some offerings. You”ve been asked to pull data that will
help the team understand what might influence the success of a campaign.&nbsp;</p>

Guided Project: Kaggle Data Science Survey

For this project, we’ll act as a data analyst for Kaggle. Kaggle surveyed a large population of data scientists and asked them about their current career status and the various skills they use. Our goal is to analyze this dataset and learn what skills are necessary for becoming a solid data scientist.

Guided Project: Predicting Listing Gains in the Indian IPO Market Using TensorFlow

Build a deep learning model to predict the listing gains of IPOs in the Indian market.

Guided Project: Predicting Bike Rentals

Apply decision trees and random forests to predict the number of bike rentals.

Guided Project: Data Presentation

A guided project to finish the data presentation module.

Guided Project: Business Intelligence Plots

Practice creating data visualizations in this guided project.

Guided Project — Data Prep in Tableau

Guided Project — Data Prep in Tableau

Guided Project: Analyzing Retail Sales

Work with retail sales data to explore trends and relationships. Build basic models to confirm the statistical significance of your insights.

Guided Project: Identifying Customers Likely to Churn for a Telecommunications Provider

In this project, we’ll conduct an Exploratory Data Analysis (EDA) on data from a telecommunications provider to create profiles on customers that are at risk of churn.

Guided Project: Visualizing the Answer to Stock Questions Using Spreadsheet Charts

In this project, we will apply the knowledge and skills we developed in the preceding lessons to present stock data clearly and help answer important questions about the underlying data.

Guided Project: Preparing Data with Excel

In this guided project, you’ll import a dataset, organize it, clean it, and then consolidate it into a single table, preparing it for analysis.

Guided Project: Building a BI App

Build a business intelligence app in Power BI that visualizes course quality and completion data for Dataquest courses.

Guided Project: Visualization of Life Expectancy and GDP Variation Over Time

Apply what you’ve learned to a real-world database.

Guided Project: Modeling Data in Power BI

Apply data cleaning and data model building to real-world scenarios

Guided Project: Creating a Kaggle Workflow

Learn how to create and use a machine learning workflow with Kaggle’s ‘Titanic’ Competition

Guided Project: Predicting the stock market

Use machine learning techniques to predict the price of the SP500.

Guided Project: Predicting House Sale Prices

Practice building and improving linear regression models

Guided Project: Predicting Car Prices

Practice the machine learning workflow using k-nearest neighbors to predict car prices.

Guided Project: Customers and Products Analysis Using SQL

Work on a real-life project using SQL

Guided Project: Creating a Project Portfolio

Create a Shiny app to showcase your personal data projects

Guided Project - Predicting Condominium Sale Prices

Combine the skills you learned in this course to perform linear regression modeling of condominium sale price on the basis of size.

Guided Project: Winning Jeopardy

Learn how to analyze text while figuring out strategies to win at Jeopardy.

Guided Project: Building a Spam Filter with Naive Bayes

Learn to use conditional probability and Naive Bayes in a practical setting.

Guided Project: Mobile App for Lottery Addiction

Learn to use probability and combinatorics in a practical setting.

Guided Project: Finding the Best Markets to Advertise In

Learn to combine the skills you learned in this course to perform practical data analysis.

Guided Project: Investigating Fandango Movie Ratings

Learn to combine the skills you learned in this course to perform practical data analysis.

Guided Project: Analyzing Movie Ratings

Learn to combine the skills you learned in this course to perform practical web data extraction.

Guided Project: Answering Business Questions using SQL

Practice the SQL data analysis workflow.

Guided Project: Designing and Creating a Database

Learn how to design, create, and populate a normalized SQL database.

Guided Project: New York Solar Resource Data

Learn to combine the skills you learned in this course to perform practical data extraction.

Guided Project: NYC Schools Perceptions

Practice your data cleaning and analysis skills and learn to use R Notebooks as you explore survey data.

Guided Project: Analyzing Forest Fire Data

Use data visualization techniques to explore data on forest fires.

Guided Project: Creating An Efficient Data Analysis Workflow, Part 2

Employ even more programming techniques to create a reusable data workflow.

Guided Project: Creating An Efficient Data Analysis Workflow

Apply control flow, loops and functions to create a reusable data workflow.

Guided Project: Investigating COVID-19 Virus Trends

Learn to combine the skills you learned in this course to perform practical data analysis.

Project: Install RStudio

Learn how to install and use RStudio, a free and open-source development environment for R.

Guided Project: Hacker News Pipeline

Learn how to use a data pipeline to summarize hacker news data.

Guided Project: Implementing a Key-Value Database

Use a B-Tree to implement a key-value datastore in Python.

Guided Project: Analyzing Stock Prices

Use the correct data structures to analyze stock prices efficiently

Guided Project: Evaluating Numerical Expressions

Use stacks to evaluate numerical expressions.

Guided Project: Analyzing Wikipedia Pages

Use MapReduce to analyze Wikipedia pages more quickly.

Guided Project: Practice Optimizing DataFrames and Processing in Chunks

Practice optimizing DataFrame types and working in chunks.

Guided Project: Analyzing Startup Fundraising Deals from Crunchbase

Practice analyzing data using the pandas SQLite workflow.

Project: PostgreSQL Installation

Learn how to install PostgreSQL and the Psycopg2 library.

Guided Project: Building Fast Queries on a CSV

Apply what you have learned to implement an inventory system for a laptop store with efficient queries.

Guided Project: Exploring Hacker News Posts

Practice using loops, cleaning strings, and working with dates in python.

Project: Learn and Install Jupyter Notebook

Learn the basics of Jupyter Notebook

Guided Project: Profitable App Profiles for the App Store and Google Play Markets

Learn to combine the skills you learned in this course to perform practical data analysis.

Project: Spark Installation and Jupyter Notebook Integration

Learn how to set up PySpark and integrate it with Jupyter Notebook.

Guided Project: Building A Handwritten Digits Classifier

Learn the basics of image classification to build a handwriting classifier.

Guided Project: Optimizing Model Prediction

In this guided project, we will predict the damage done by forest fires and improve upon an existing model. 

Guided Project: Predicting Employee Productivity Using Tree Models

In this project, using the “Productivity Prediction of Garment Employees” dataset from the UCI Machine Learning Repository, we’ll determine the best working conditions to reach the expected productivity thresholds in a garment factory.

Guided Project: Classifying Heart Disease

In this guided project, you will practice the machine learning workflow and practice creating and optimizing a logistic regression to detect heart disease. 

Guided Project: Stochastic Gradient Descent on Linear Regression

In this project, you will load, explore, and prepare a dataset to build a stochastic gradient descent regression model (linear regression), and then you will measure the efficiency of the model and visualize the results.

Guided Project: Predicting Insurance Costs

In this guided project, practice linear regression modeling and evaluation.

Guided Project: Credit Card Customer Segmentation

For this project, we’ll build a clustering model to segment credit card customers into different groups in order to apply different solutions for each type of customer. 

Guided Project: Predicting Heart Disease

Build a K Nearest Neighbors classifier to predict whether patients might be at risk of heart disease.

Project: Git Installation and GitHub Integration

Learn how to install Git and authenticate with GitHub.

Guided Project: Popular Data Science Questions

Apply the skills you’ve learned to determine the creation of data science content.

Guided Project: Analyzing CIA Factbook Data Using SQL

Practice using SQL to analyze CIA Factbook data.

Guided Project: Analyzing NYC High School Data

Perform additional analysis on a clean dataset with maps and visualizations.

Guided Project: Star Wars Survey

Use survey data to better understand Star Wars fans.

Guided Project: Clean and Analyze Employee Exit Surveys

Practice data cleaning using pandas.

Guided Project: Storytelling Data Visualization on Exchange Rates

Combine the skills you’ve learned to create a storytelling data visualization.

Guided Project: Finding Heavy Traffic Indicators on I-94

Learn to analyze data using exploratory data visualization.

Guided Project: Exploring eBay Car Sales Data

Practice data cleaning and data exploration using pandas

Guided Project: Prison Break

Learn the basics of Jupyter Notebook by analyzing a dataset on helicopter prison escapes