A collection of the best places to find free data sets for data visualization, data cleaning, machine learning, and data processing projects.
Improve your data science communication skills and make your presentations more convincing by following these simple steps.
Ahoy, mateys! Happy International Talk Like A Pirate Day! You may think a pirate’s life sounds like fun, but it isn’t all buried treasure and singing yo-ho-ho. Pirates have lots to do: Predicting profitability of plundering events based on crew size and ship features Optimizing trade routes to avoid the law, storms, and other pirates […]
This post was written by Lisa Charlotte Rost. Lisa is a designer at Datawrapper. Based in Berlin, she organizes the Data Vis meetup and enjoys the few sunny days there. This article was originally posted on Datawrapper, and has been reposted with permission. Data Visualisation can be defined as representing numbers with shapes – and […]
Ed Hawkins, a climate scientist, tweeted the following animated visualization in 2017 and captivated the world: This visualization shows the deviations from the average temperature between 1850 and 1900. It was reshared millions of times over Twitter and Facebook and a version of it was even shown at the opening ceremony for the Rio Olympics. […]
Editor’s note: This post was written as part of a collaboration with iDataLabs, a marketing intelligence company. Author Julia Cook works in marketing at iDataLabs. We’re all familiar with data visualizations — word clouds, pie charts, pivot tables — but how does one put enquiries in paint? Patty Haller, a landscape artist from Seattle WA, may have figured that […]
I’ve recently been working on the Digital Panopticon, a digital history project that has brought together (and created) massive amounts of data about British prisoners and convicts in the long 19th century, including several datasets which include heights for women. Adult height is strongly influenced by environmental factors in childhood, one of the most important […]
This post looks at the World Bank World Development Indicators (WDI). This massive collection has data in several categories: demographic, education, work, poverty, health. It includes both country-level data and various aggregates by different criteria: geographical regions, income levels, etc. The UK Data Service has a useful guide as well as access to the data. […]
This post is the second in a series on visualizing the Women’s Marches from January 2017. In the first post, we explored the intensive data collection and data cleaning process necessary to produce clean pandas dataframes. Data Enrichment Because we eventually want to be able to build maps visualizing the marches, we need latitude and […]
In celebration of Women’s History Month, I wanted to better understand the scale of the Women’s Marches that occurred in January 2017. Shortly after the marches, Vox published a map visualizing the estimated turnout across the entire country. This map is excellent at displaying: locations with the highest relative turnouts hubs and clusters of where […]
If you read data science articles, you may have already stumbled upon FiveThirtyEight’s content. Naturally, you were impressed by their awesome visualizations. You wanted to make your own awesome visualizations and so asked Quora and Reddit how to do it. You received some answers, but they were rather vague. You still can’t get the graphs […]
Yes, you read correctly — this post will only give you 1 tip. I know most posts like this have 5 or more tips. I once saw a post with 15 tips, but I may have been daydreaming at the time. You’re probably wondering what makes this 1 tip so special. “Vik”, you may ask, […]
Learn how seven Python data visualization tools can be used together to perform exploratory data analysis and aid in data viz tasks.
In this tutorial, we’ll guide you through the basic principles of machine learning, and how to get started with machine learning with Python.