Data cleaning might not be the reason you got interested in data science, but if you’re going to be a data scientist, no skill is more crucial. Learn how to clean data with Python and pandas in our new course.
Ahoy, mateys! Happy International Talk Like A Pirate Day! You may think a pirate’s life sounds like fun, but it isn’t all buried treasure and singing yo-ho-ho. Pirates have lots to do: Predicting profitability of plundering events based on crew size and ship features Optimizing trade routes to avoid the law, storms, and other pirates […]
In celebration of Women’s History Month, I wanted to better understand the scale of the Women’s Marches that occurred in January 2017. Shortly after the marches, Vox published a map visualizing the estimated turnout across the entire country. This map is excellent at displaying: locations with the highest relative turnouts hubs and clusters of where […]
Cleaning and preparing data is a critical first step in any machine learning project. In this blog post, Dataquest student Daniel Osei takes us through examining a dataset, selecting columns for features, exploring the data visually and then encoding the features for machine learning. After first reading about Machine Learning on Quora in 2015, Daniel […]
This is the fifth post in a series of posts on how to build a Data Science Portfolio. You can find links to the other individual posts in this series at the bottom of the post. If you’ve ever worked on a personal data science project, you’ve probably spent a lot of time browsing the […]
The Museum of Modern Art is one of the most influential museums in the world and they have released a dataset on the artworks in their collection. The dataset has some data quality issues, however, and requires cleanup. In a previous post, we discussed how we used Python and Pandas to clean the dataset. In […]
Art is a messy business. Over centuries, artists have created everything from simple paintings to complex sculptures, and art historians have been cataloging everything they can along the way. The Museum of Modern Art, or MoMA for short, is considered one of the most influential museums in the world and recently released a dataset of […]