Learn data cleaning for a machine learning project by cleaning and preparing loan data from LendingClub for a predictive analytics project.
Learn to clean data and replace missing values in Python using advanced skills like regular expressions (regex), list comprehensions, lambda functions, etc.
A collection of the best places to find free data sets for data visualization, data cleaning, machine learning, and data processing projects.
Data cleaning might not be the reason you got interested in data science, but if you’re going to be a data scientist, no skill is more crucial. Learn how to clean data with Python and pandas in our new course.
Ahoy, mateys! Happy International Talk Like A Pirate Day! You may think a pirate’s life sounds like fun, but it isn’t all buried treasure and singing yo-ho-ho. Pirates have lots to do: Predicting profitability of plundering events based on crew size and ship features Optimizing trade routes to avoid the law, storms, and other pirates […]
In celebration of Women’s History Month, I wanted to better understand the scale of the Women’s Marches that occurred in January 2017. Shortly after the marches, Vox published a map visualizing the estimated turnout across the entire country. This map is excellent at displaying: locations with the highest relative turnouts hubs and clusters of where […]
Learn how to clean data on the command line, a key skill for doing data analysis and data science, using Python and csvkit.
A step-by-step tutorial on data cleaning (or data munging, a core data science skill) a dataset from the MoMA with Python, using the Pandas module.