Tag: Pandas

Dataquest’s Philosophy: Building the Perfect Data Science Learning Tool

Learn how Dataquest’s philosophy sets our platform apart from other data science learning tools, and what we’ve learned from years of teaching data science.

Read More

Tutorial: Advanced For Loops in Python

If you’ve already mastered the basics of iterating through Python lists, take it to the next level and learn to use for loops in pandas, numpy, and more!

Read More

Python Machine Learning Tutorial: Predicting Airbnb Prices

Learn about machine learning in Python and build your very first ML model from scratch to predict Airbnb prices using k-nearest neighbors.

Read More

Data Cleaning and Preparation for Machine Learning

Learn data cleaning for a machine learning project by cleaning and preparing loan data from LendingClub for a predictive analytics project.

Read More

Tutorial: Text Analysis in Python to Test a Hypothesis

Learn to do some text analysis in this Python tutorial, and test hypotheses using confidence intervals to insure your conclusions are significant.

Read More

Tutorial: Text Classification in Python Using spaCy

Learn text classification using linear regression in Python using the spaCy package in this free machine learning tutorial.

Read More

New Course: Learn Data Cleaning with Python and Pandas

Data cleaning might not be the reason you got interested in data science, but if you’re going to be a data scientist, no skill is more crucial. Learn how to clean data with Python and pandas in our new course.

Read More

Tutorial: Advanced Jupyter Notebooks

you’re doing data science in Python, notebooks are a powerful tool. This free Jupyter Notebooks tutorial has will help you get the best out of Jupyter.

Read More

Data Science Portfolio Project: Where to Advertise an E-learning Product

At Dataquest, we strongly advocate portfolio projects as a means of getting a first data science job. In this blog post, we’ll walk you through an example portfolio project. The project is part of our Statistics Intermediate: Averages and Variability course, and it assumes familiarity with: Sampling (populations, samples, sample representativity) Frequency distributions Box plots […]

Read More

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

At Dataquest, we strongly advocate portfolio projects as a means of getting your first data science job. In this blog post, we’ll walk you through an example portfolio project. The project is part of our Statistics Fundamentals course, and it assumes some familiarity with: Sampling (simple random sampling, populations, samples, parameters, statistics) Variables Frequency distributions […]

Read More

Visualizing Women’s Marches: Part 1

In celebration of Women’s History Month, I wanted to better understand the scale of the Women’s Marches that occurred in January 2017. Shortly after the marches, Vox published a map visualizing the estimated turnout across the entire country. This map is excellent at displaying: locations with the highest relative turnouts hubs and clusters of where […]

Read More

Adding Axis Labels to Plots With pandas

Pandas plotting methods provide an easy way to plot pandas objects. Often though, you’d like to add axis labels, which involves understanding the intricacies of Matplotlib syntax. Thankfully, there’s a way to do this entirely using pandas. Let’s start by importing the required libraries: import pandas as pd import numpy as np import matplotlib.pyplot as […]

Read More

Tutorial: Concatenation (Combining Data Tables) with Pandas and Python

In this tutorial, we walk through several methods of combining data tables (concatenation) using pandas and Python, working with labor market data.

Read More

Tutorial Using Excel with Python and Pandas

In this tutorial, we’ll learn to work with Excel files in Python using pandas — everything from setting up your computer to moving and visualizing data.

Read More

Kaggle Fundamentals: The Titanic Competition

Kaggle is a site where people create algorithms and compete against machine learning practitioners around the world. Your algorithm wins the competition if it’s the most accurate on a particular data set. Kaggle is a fun way to practice your machine learning skills. This tutorial is based on part of our free, four-part course: Kaggle […]

Read More