Tag Archives for " python "

Learning Python for Data Science the Right Way

Last week, we launched a totally revamped version of our introductory Python course. Now, we’re doing the same for our intermediate Python course. Say hello to Python for Data Science: Intermediate! The new course is available now, and just like the introductory Python course, it’s completely free. This course has been carefully designed to build […]

Learn Python the right way in 5 steps

Python is an amazingly versatile programming language. You can use it to build websites, machine learning algorithms, and even autonomous drones. A huge percentage of programmers in the world use Python, and for good reason. It gives you the power to create almost anything. But — and this is a big but — you have […]

digitalocean-docker

DigitalOcean and Docker for Data Science

Creating a cloud-based data science environment for faster analysis There are times when working on data science problems with your local machine just doesn’t cut it anymore. Maybe your computer is old, and can’t work with larger datasets. Or maybe you want to be able to access your work from anywhere, and collaborate with others. […]

Tutorial: An Introduction to Apache Spark

Overview After lots of ground-breaking work led by the UC Berkeley AMP Lab, Apache Spark was developed to utilize distributed, in-memory data structures to improve data processing speeds over Hadoop for most workloads. In this post, we’re going to cover the architecture of Spark and basic transformations and actions using a real dataset. If you […]

Tutorial: Introduction to Using APIs in Python

Application Program Interfaces, or APIs, are commonly used to retrieve data from remote websites. Sites like Reddit, Twitter, and Facebook all offer certain data through their APIs. To use an API, you make a request to a remote web server, and retrieve the data you need. But why use an API instead of a static […]

nba-basketball-python-knn-tutorial-k-nearest-neighbors

Tutorial: K Nearest Neighbors in Python

In this post, we’ll be using the K-nearest neighbors algorithm to predict how many points NBA players scored in the 2013-2014 season. Along the way, we’ll learn about euclidean distance and figure out which NBA players are the most similar to Lebron James. If you want to follow along, you can grab the dataset in […]

us-senators-python-data-tutorial

Tutorial: K-Means Clustering US Senators

Clustering is a powerful way to split up datasets into groups based on similarity. A very popular clustering algorithm is K-means clustering. In K-means clustering, we divide data up into a fixed number of clusters while trying to ensure that the items in each cluster are as similar as possible. In this post, we’ll explore […]

Share On Facebook
Share On Twitter
Share On Linkedin
Share On Reddit