Learn text classification using linear regression in Python using the spaCy package in this free machine learning tutorial.
Deep learning is a type of machine learning that’s growing at an almost frightening pace. Nearly every projection has the deep learning industry expanding massively over the next decade. This market research report, for example, expects deep learning to grow 71x in the US and more than that globally over the next ten years. There’s […]
Error metrics are short and useful summaries of the quality of our data. We dive into four common regression metrics and discuss their use cases.
Getting into Machine Learning and AI is not an easy task, but is a critical part of data science programs. Many aspiring professionals and enthusiasts find it hard to establish a proper path into the field, given the enormous amount of resources available today. The field is evolving constantly and it is crucial that we […]
Machine learning algorithms are key for anyone who’s interested in the data science field. Here’s an introduction to ten of the most fundamental ML algorithms.
Women are underrepresented in STEM fields – science, technology, engineering, and math. For instance, women made up 27% of people employed in computer and mathematical occupations in 1960. But instead of growing over several decades, as many more women participated in the workforce overall, that number had declined to 26% by 2013, according to a […]
This Python data science tutorial uses a real-world data set to teach you how to diagnose and reduce bias and variance in machine learning.
Kaggle is a site where people create algorithms and compete against machine learning practitioners around the world. Your algorithm wins the competition if it’s the most accurate on a particular data set. Kaggle is a fun way to practice your machine learning skills. This tutorial is based on part of our free, four-part course: Kaggle […]
Machine learning is easily one of the biggest buzzwords in tech right now. Over the past three years, Google searches for “machine learning” have increased by over 350%. But understanding machine learning can be difficult — you either use pre-built packages that act like ‘black boxes’ where you pass in data and magic comes out […]
Cleaning and preparing data is a critical first step in any machine learning project. In this blog post, Dataquest student Daniel Osei takes us through examining a dataset, selecting columns for features, exploring the data visually and then encoding the features for machine learning. After first reading about Machine Learning on Quora in 2015, Daniel […]
A collection of the best places to find free data sets for data visualization, data cleaning, machine learning, and data processing projects.
Learn how to build an end to end machine learning project — a key part of any data science portfolio — in this free tutorial walkthrough.
Python is becoming an increasingly popular language for data science, and with good reason. It’s easy to learn, has powerful data science libraries, and integrates well with databases and tools like Hadoop and Spark. With Python, we can perform the full lifecycle of data science projects, including reading data in, analyzing data, visualizing data, and […]
In this tutorial, we’ll guide you through the basic principles of machine learning, and how to get started with machine learning with Python.
Learn to predict sentiment in movie reviews with machine learning using naive bayes classifiction, Python, and scikit-learn.