Avatar
Josh Devlin
Author Archives: Josh Devlin

Adding Axis Labels to Plots With pandas

Pandas plotting methods provide an easy way to plot pandas objects. Often though, you’d like to add axis labels, which involves understanding the intricacies of Matplotlib syntax. Thankfully, there’s a way to do this entirely using pandas. Let’s start by importing the required libraries: import pandas as pd import numpy as np import matplotlib.pyplot as […]

Kaggle Fundamentals: The Titanic Competition

Kaggle is a site where people create algorithms and compete against machine learning practitioners around the world. Your algorithm wins the competition if it’s the most accurate on a particular data set. Kaggle is a fun way to practice your machine learning skills. This tutorial is based on part of our free, four-part course: Kaggle […]

Python Cheat Sheet for Data Science: Intermediate

The printable version of this cheat sheet The tough thing about learning data is remembering all the syntax. While at Dataquest we advocate getting used to consulting the Python documentation, sometimes it’s nice to have a handy reference, so we’ve put together this cheat sheet to help you out! This cheat sheet is the companion […]

How to Get Your First Job as a Data Scientist.

Many aspiring data scientists focus on doing Kaggle competitions as a way to build their portfolios. Kaggle is an excellent way to practice, but it should only be one of many avenues you use to work on data science projects. This is because Kaggle competitions only focus on a narrow part of data science work. […]

Introducing our new Interface

Our new mission design has arrived! Over the past few months we’ve been tirelessly talking to students like you to learn how we can improve the mission interface. Today we are unveiling the results of this hard work. Since a lot has changed, we wanted to take a moment to describe the big changes and […]

Python Cheat Sheet for Data Science: Basics

It’s common when first learning Python for Data Science to have trouble remembering all the syntax that you need. While at Dataquest we advocate getting used to consulting the Python documentation, sometimes it’s nice to have a handy reference, so we’ve put together this cheat sheet to help you out! This cheat sheet is the […]

Should I Learn Python 2 or 3?

One of the biggest sources of confusion and misinformation for people wanting to learn Python is which version they should learn. Should I learn Python 2.x or Python 3.x? Indeed, this is one of the questions we are asked most often at Dataquest, where we teach Python as part of our Data Science curriculum. This […]

Harry: “Dataquest helped me start my career in data”

While working as a geophysicist for an oil services company, Harry Robinson found himself interested in data. “My job involved lots of data, but it was always at arms length. We were applying algorithms, but I never got to see them. “I wanted to know what was happening and why, so I could interpret the results.” He decided […]

NumPy Cheat Sheet — Python for Data Science

NumPy is the library that gives Python its ability to work with data at speed. Originally, launched in 1995 as ‘Numeric,’ NumPy is the foundation on which many important Python data science libraries are built, including Pandas, SciPy and scikit-learn. It’s common when first learning NumPy to have trouble remembering all the functions and methods […]

Kyle: “Dataquest helped me get into the tech industry”

For the first four years of his career, Kyle Stewart worked as a product manager in industrial automation. “I was working for a fortune 500 company. I managed products that helped industrial processes, like at an oil refinery.” He wanted to move into the more dynamic tech industry. “In industrial product management it’s difficult to make […]