Introduction to AWS for Data Scientists

These days, many businesses use cloud based services; as a result various companies have started building and providing such services. Amazon began the trend, with Amazon Web Services (AWS). While AWS began in 2006 as a side business, it now makes $14.5 billion in revenue each year. Other leaders in this area include: Google—Google Cloud […]

How to Write a Bootcamp Review that Actually Helps People

Editor’s note: This post was written as part of a collaboration with SwitchUp, an online platform for researching and reviewing technology learning programs. Erica Freedman is a Content and Client Services Specialist at SwitchUp. Data Science is a rapidly growing industry. From university programs to week-long cohorts, it can be difficult to decide where to […]

Write for Dataquest

UPDATE: This program has changed, for the latest information on how to write for Dataquest please refer to this article. The Dataquest blog is read by over 100,000 readers each month — this is an opportunity for you to get your work seen, and grow your platform. You don’t have to be a professional writer […]

Introduction to Python Ensembles

Stacking models in Python efficiently Ensembles have rapidly become one of the hottest and most popular methods in applied machine learning. Virtually every winning Kaggle solution features them, and many data science pipelines have ensembles in them. Put simply, ensembles combine predictions from different models to generate a final prediction, and the more models we […]

Postgres Internals: Building a Description Tool

In previous blog posts, we have described the Postgres database and ways to interact with it using Python. Those posts provided the basics, but if you want to work with databases in production systems, then it is necessary to know how to make your queries faster and more efficient. To understand what efficiency means in […]

Priya: “Dataquest helped me to help others”

Priya Iyer decided to learn data science so that she could better help people. Her startup, Tulalens, operated for two years and raised $100k towards helping women in urban slums. Tulelens helped the women launch small businesses that sold iron-rich foods, and shared information on iron-deficiency anemia. Priya and her partners realized that they could be […]

Luiz: “Dataquest helped me learn on my own schedule”

Luiz Zanini was working as a Mechatronics Engineer when he decided he needed a radical career change. He was frustrated by corporate life, and knew he’d be happier as a programmer. When researching new career paths, Data Scientist stood out—he really liked Python and dreamt of a digital nomad lifestyle. “I made a list of the […]

Adding Axis Labels to Plots With pandas

Pandas plotting methods provide an easy way to plot pandas objects. Often though, you’d like to add axis labels, which involves understanding the intricacies of Matplotlib syntax. Thankfully, there’s a way to do this entirely using pandas. Let’s start by importing the required libraries: import pandas as pd import numpy as np import matplotlib.pyplot as […]

Setting Up the PyData Stack on Windows

The speed of modern electronic devices allows us to crunch large amounts of data at home. However, these devices require the right software in order to reach peak performance. Luckily, it’s now easier than ever to set up your own data science environment. One of the most popular stacks for data science is PyData, a […]

How to Start a Data Science Meetup

Meetups are great tools, you’re able to meet people in the field, keep up on industry news, and learn how to ‘talk the talk.’ Before I started attending meetups I wasn’t aware of just how much I didn’t know and still had to learn, let alone what was missing in how I wrote code and […]

Kaggle Fundamentals: The Titanic Competition

Kaggle is a site where people create algorithms and compete against machine learning practitioners around the world. Your algorithm wins the competition if it’s the most accurate on a particular data set. Kaggle is a fun way to practice your machine learning skills. This tutorial is based on part of our free, four-part course: Kaggle […]

Five Essential Traits of a Data Scientist

Trillions of pixels have been deployed to answer the question ‘What makes a good data scientist?’ Most of these articles have focused on skills and tools of data science while almost none have discussed the personalities that make good, even great, data scientists. A Google search for “data science skills” returns 38 million results; ‘data […]

SQL Fundamentals

The pandas workflow is a common favorite among data analysts and data scientists. The workflow looks something like this: The pandas workflow works well when: the data fits in memory (a few gigabytes but not terabytes) the data is relatively static (doesn’t need to be loaded into memory every minute because the data has changed) […]

1 8 9 10 11 12 14