Tag Archives for " Data Science "
how-to-use-dataquest-feat

How to Use Dataquest

Dataquest’s learning platform is user-friendly enough that if you’d like to, you can simply dive right in. But if you’re the type who likes to flip through the user manual first, this article is for you. In it, we’re going to cover the basic features of the Dataquest platform, and pass along some helpful tips […]

An Intro to Deep Learning in Python

Deep learning is a type of machine learning that’s growing at an almost frightening pace. Nearly every projection has the deep learning industry expanding massively over the next decade. This market research report, for example, expects deep learning to grow 71x in the US and more than that globally over the next ten years. There’s […]

Learn to do Data Viz in R

One of the reasons that R is a top language for data science is that it’s great for data visualization. R users can take advantage of the wildly popular ggplot2 package to turn massive data sets into easily-readable charts in just a few lines of code. That can be incredibly valuable for presenting your data, […]

Python vs R: Head to Head Data Analysis

Which is better for data analysis? There have been dozens of articles written comparing Python and R from a subjective standpoint. This article aims to look at the languages more objectively. We’ll analyze a data set side by side in Python and R, and show what code is needed in both languages to achieve the […]

3 Mighty Good Reasons to Learn R for Data Science

Ahoy, mateys! Happy International Talk Like A Pirate Day! You may think a pirate’s life sounds like fun, but it isn’t all buried treasure and singing yo-ho-ho. Pirates have lots to do: Predicting profitability of plundering events based on crew size and ship features Optimizing trade routes to avoid the law, storms, and other pirates […]

DIY AI for the Future

Editor’s note: This post is the result of a collaboration with PredictX, a decision automation platform. Author Joni Lindes is a content writer at PredictX. AI is set to disrupt our current society on a major scale. According to Indeed, the number of roles in AI has risen by 485% in the UK since 2014, […]

Programming Best Practices For Data Science

The data science life cycle is generally comprised of the following components: data retrieval data cleaning data exploration and visualization statistical or predictive modeling While these components are helpful for understanding the different phases, they don’t help us think about our programming workflow. Often, the entire data science life cycle ends up as an arbitrary […]

Data Retrieval and Cleaning: Tracking Migratory Patterns

Advancing your skills is an important part of being a data scientist. When starting out, you mostly focus on learning a programming language, proper use of third party tools, displaying visualizations, and the theoretical understanding of statistical algorithms. The next step is to test your skills on more difficult data sets. Sometimes these data sets […]

Learning From Bank Data: Women Across the World

This post looks at the World Bank World Development Indicators (WDI). This massive collection has data in several categories: demographic, education, work, poverty, health. It includes both country-level data and various aggregates by different criteria: geographical regions, income levels, etc. The UK Data Service has a useful guide as well as access to the data. […]

Exploring Women’s Army Auxiliary Corps Data

Today I want to go on an excursion in “catalogues as data“. The UK National Archives’ Discovery catalogue is an excellent resource for this activity, because a) it has a lot of records that have document descriptions at ‘item’ or ‘piece’ level in the catalogue, containing quite structured information (like dates, places, occupations) that can […]

Data Science Terms and Jargon: A Glossary

Getting started in data science can be overwhelming, especially when you consider the variety of concepts and techniques a data scienctist needs to master in order to do her job effectively. Even the term “data science” can be somewhat nebulous, and as the field gains popularity it seems to lose definition. To help those new […]

10 Data Science Projects You Can Join Today

Editor’s note: This post was written as part of a collaboration with data.world, a site for sharing and hosting data. Authors Shannon Peifer and Gabriela Swider are on the data.world team. Finding the right data can be difficult. And even once you have it, how do you collaborate with others to make sense of it? […]

How to Write a Bootcamp Review that Actually Helps People

Editor’s note: This post was written as part of a collaboration with SwitchUp, an online platform for researching and reviewing technology learning programs. Erica Freedman is a Content and Client Services Specialist at SwitchUp. Data Science is a rapidly growing industry. From university programs to week-long cohorts, it can be difficult to decide where to […]

Want a Job in Data? Learn This.

Why mastering a 50-year-old programming language is the key to getting a data science job. SQL is old. There, I said it. I first heard about SQL in 1997. I was in high school, and as part of a computing class we were working with databases in Microsoft Access. The computers we used were outdated, […]

Introduction to Python Ensembles

Stacking models in Python efficiently Ensembles have rapidly become one of the hottest and most popular methods in applied machine learning. Virtually every winning Kaggle solution features them, and many data science pipelines have ensembles in them. Put simply, ensembles combine predictions from different models to generate a final prediction, and the more models we […]

Regular Expressions for Data Scientists

As data scientists, diving headlong into huge heaps of data is part of the mission. Sometimes, this includes massive corpuses of text. For instance, suppose we were asked to figure out who’s been emailing whom in the scandal of the Panama Papers — we’d be sifting through 11.5 million documents! We could do that manually […]

How to Start a Data Science Meetup

Meetups are great tools, you’re able to meet people in the field, keep up on industry news, and learn how to ‘talk the talk.’ Before I started attending meetups I wasn’t aware of just how much I didn’t know and still had to learn, let alone what was missing in how I wrote code and […]

Five Essential Traits of a Data Scientist

Trillions of pixels have been deployed to answer the question ‘What makes a good data scientist?’ Most of these articles have focused on skills and tools of data science while almost none have discussed the personalities that make good, even great, data scientists. A Google search for “data science skills” returns 38 million results; ‘data […]

SQL Fundamentals

The pandas workflow is a common favorite among data analysts and data scientists. The workflow looks something like this: The pandas workflow works well when: the data fits in memory (a few gigabytes but not terabytes) the data is relatively static (doesn’t need to be loaded into memory every minute because the data has changed) […]

How to Get Your First Job as a Data Scientist.

Many aspiring data scientists focus on doing Kaggle competitions as a way to build their portfolios. Kaggle is an excellent way to practice, but it should only be one of many avenues you use to work on data science projects. This is because Kaggle competitions only focus on a narrow part of data science work. […]

Should I Learn Python 2 or 3?

One of the biggest sources of confusion and misinformation for people wanting to learn Python is which version they should learn. Should I learn Python 2.x or Python 3.x? Indeed, this is one of the questions we are asked most often at Dataquest, where we teach Python as part of our Data Science curriculum. This […]

How to become a data scientist

Data science is one of the most buzzed about fields right now, and data scientists are in extreme demand. And with good reason — data scientists are doing everything from creating self-driving cars to automatically captioning images. Given all the interesting applications, it makes sense that data science is a very sought-after career. Data science […]

How to get a data science job

You’ve done it. You just spent months learning how to analyze data and make predictions. You’re now able to go from raw data to well structured insights in a matter of hours. After all that effort, you feel like it’s time to take the next step, and get your first data science job. Unfortunately for […]

Python for data science: Getting started

Python is becoming an increasingly popular language for data science, and with good reason. It’s easy to learn, has powerful data science libraries, and integrates well with databases and tools like Hadoop and Spark. With Python, we can perform the full lifecycle of data science projects, including reading data in, analyzing data, visualizing data, and […]

Share On Facebook
Share On Twitter
Share On Linkedin
Share On Reddit