Learn how to clean data on the command line, a key skill for doing data analysis and data science, using Python and csvkit.
Creating a cloud-based data science environment for faster analysis There are times when working on data science problems with your local machine just doesn’t cut it anymore. Maybe your computer is old, and can’t work with larger datasets. Or maybe you want to be able to access your work from anywhere, and collaborate with others. […]
Learn to set up a Docker data science environment using Docker containers and the popular Jupyter Notebook in this free tutorial.
Learn how seven Python data visualization tools can be used together to perform exploratory data analysis and aid in data viz tasks.
Here’s how to install PySpark on your computer and get started working with large data sets using Python and PySpark in a Jupyter Notebook.
In this tutorial, we’ll guide you through the basic principles of machine learning, and how to get started with machine learning with Python.
A step-by-step tutorial on data cleaning (or data munging, a core data science skill) a dataset from the MoMA with Python, using the Pandas module.
A tutorial that explores the Python counter class and uses the counter class for Probability Mass Functions and Bayesian Statistics.
Learn to predict sentiment in movie reviews with machine learning using naive bayes classifiction, Python, and scikit-learn.