Category Archives for "Data Science Projects"

Data Science Portfolio Project: Where to Advertise an E-learning Product

At Dataquest, we strongly advocate portfolio projects as a means of getting a first data science job. In this blog post, we’ll walk you through an example portfolio project. The project is part of our Statistics Intermediate: Averages and Variability course, and it assumes familiarity with: Sampling (populations, samples, sample representativity) Frequency distributions Box plots […]

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

At Dataquest, we strongly advocate portfolio projects as a means of getting your first data science job. In this blog post, we’ll walk you through an example portfolio project. The project is part of our Statistics Fundamentals course, and it assumes some familiarity with: Sampling (simple random sampling, populations, samples, parameters, statistics) Variables Frequency distributions […]

A Data Science Project Style Guide

Employers usually give a lot of weight to a candidate’s portfolio when hiring for a junior data science role. Although you may be capable of technically impressive projects, your job hunt will suffer if you don’t pay enough attention to the stylistic aspects as well. A busy employer is not going to review poorly constructed […]

Generating Climate Temperature Spirals in Python

Ed Hawkins, a climate scientist, tweeted the following animated visualization in 2017 and captivated the world: This visualization shows the deviations from the average temperature between 1850 and 1900. It was reshared millions of times over Twitter and Facebook and a version of it was even shown at the opening ceremony for the Rio Olympics. […]

5 Ways to Find Interesting Data Sets

Editor’s note: This post was written as part of a collaboration with Enigma, a public data company. Author India Kerle is a data curator at Enigma. There are a canon of open datasets used widely in data science projects — you’ve likely come across something making use of the Iris Flower classic or New York’s […]

10 Data Science Projects You Can Join Today

Editor’s note: This post was written as part of a collaboration with data.world, a site for sharing and hosting data. Authors Shannon Peifer and Gabriela Swider are on the data.world team. Finding the right data can be difficult. And even once you have it, how do you collaborate with others to make sense of it? […]

Postgres Internals: Building a Description Tool

In previous blog posts, we have described the Postgres database and ways to interact with it using Python. Those posts provided the basics, but if you want to work with databases in production systems, then it is necessary to know how to make your queries faster and more efficient. To understand what efficiency means in […]