Home|

Category: Data Science Projects

Building a Recommender System with Netflix Data in R

Ok, so I finally got a chance to finish this three-part series. It’s been a long one, but better late than never. Since it’s been a while, I figured a quick recap is in order. To show how to approach an unguided data project, I decided to use some Netflix data to demonstrate the process […]

Read More

NLP Project Part 1: Scraping the Web to Gather Data

This is the first in a series of posts describing my natural language processing (NLP) project. To really benefit from this NLP article, you should understand the pandas library and know regex for cleaning data. We’ll also focus on web scraping, so elementary knowledge of HTML (the language used for creating websites) is very helpful, […]

Read More

Enormous Survey Reveals the Best Halloween Candies

Halloween is arguably the best holiday for those of us who like candy (which is everyone). I always loved Halloween because no matter how old you are, you probably want some type of candy. But, with the season only a few days away, you may be wondering two things: what costume will you wear, and […]

Read More

8 Rarely Used Python Libraries & How to Use Them

The most popular Python libraries out there are usually TensorFlow, Numpy, PyTorch, Pandas, Scikit-Learn, Keras and a few others. Although you may come across these names pretty frequently there are thousands of Python libraries out there that you can work with. In this article we are going to focus on how to use Python libraries […]

Read More

How I Scraped Over 25,000 Forum Posts In 3 Steps

Motivation The Dataquest Community is evolving. In the past months, I’ve been watching the growth of active users, new topics, and areas of discussion. Over the past six months, the platform has made a great contribution to the development of various tags to facilitate the filtering and search for relevant information. But let’s try to […]

Read More

How to Plot a Bar Graph in Matplotlib: The Easy Way

A bar graph or bar chart is one of the most common visualization types and is very easy to create in Matplotlib. All we need to do is write one short line of Python code. However, if we want to create an informative, easily readable bar plot that efficiently reveals the story behind the data, […]

Read More

Comical Data Visualization in Python Using Matplotlib

How to make comical visualizations in Matplotlib? Explained using a Netflix Movie and TV Show dataset. Data visualization is a great way to tell a story. You can easily absorb information and identify patterns in data. One of our students decided to create a data visualization in Python using Matplotlib to understand the different types […]

Read More

45 Fun (and Unique) Python Project Ideas for Easy Learning

Building projects is an extremely succesful way to learn, but building Python projects for beginners can be difficult. Learn how to build with success!

Read More

21 Places to Find Free Datasets for Data Science Projects

A collection of the best places to find free data sets for data visualization, data cleaning, machine learning, and data processing projects.

Read More

Tutorial: Find Dominant Colors in an Image through Clustering

Take the first step into image analysis in Python by using k-means clustering to analyze the dominant colors in an image in this free data science tutorial.

Read More