Tag Archives for " Data Science "

How to Use Dataquest

Dataquest’s learning platform is user-friendly enough that if you’d like to, you can simply dive right in. But if you’re the type who likes to flip through the user manual first, this article is for you. In it, we’re going to cover the basic features of the Dataquest platform, and pass along some helpful tips […]

An Intro to Deep Learning in Python

Deep learning is a type of machine learning that’s growing at an almost frightening pace. Nearly every projection has the deep learning industry expanding massively over the next decade. This market research report, for example, expects deep learning to grow 71x in the US and more than that globally over the next ten years. There’s […]

Learn to do Data Viz in R

One of the reasons that R is a top language for data science is that it’s great for data visualization. R users can take advantage of the wildly popular ggplot2 package to turn massive data sets into easily-readable charts in just a few lines of code. That can be incredibly valuable for presenting your data, […]

3 Mighty Good Reasons to Learn R for Data Science

Ahoy, mateys! Happy International Talk Like A Pirate Day! You may think a pirate’s life sounds like fun, but it isn’t all buried treasure and singing yo-ho-ho. Pirates have lots to do: Predicting profitability of plundering events based on crew size and ship features Optimizing trade routes to avoid the law, storms, and other pirates […]

DIY AI for the Future

Editor’s note: This post is the result of a collaboration with PredictX, a decision automation platform. Author Joni Lindes is a content writer at PredictX. AI is set to disrupt our current society on a major scale. According to Indeed, the number of roles in AI has risen by 485% in the UK since 2014, […]

Programming Best Practices For Data Science

The data science life cycle is generally comprised of the following components: data retrieval data cleaning data exploration and visualization statistical or predictive modeling While these components are helpful for understanding the different phases, they don’t help us think about our programming workflow. Often, the entire data science life cycle ends up as an arbitrary […]

Data Retrieval and Cleaning: Tracking Migratory Patterns

Advancing your skills is an important part of being a data scientist. When starting out, you mostly focus on learning a programming language, proper use of third party tools, displaying visualizations, and the theoretical understanding of statistical algorithms. The next step is to test your skills on more difficult data sets. Sometimes these data sets […]

Learning From Bank Data: Women Across the World

This post looks at the World Bank World Development Indicators (WDI). This massive collection has data in several categories: demographic, education, work, poverty, health. It includes both country-level data and various aggregates by different criteria: geographical regions, income levels, etc. The UK Data Service has a useful guide as well as access to the data. […]

Exploring Women’s Army Auxiliary Corps Data

Today I want to go on an excursion in “catalogues as data“. The UK National Archives’ Discovery catalogue is an excellent resource for this activity, because a) it has a lot of records that have document descriptions at ‘item’ or ‘piece’ level in the catalogue, containing quite structured information (like dates, places, occupations) that can […]

Data Science Terms and Jargon: A Glossary

Getting started in data science can be overwhelming, especially when you consider the variety of concepts and techniques a data scienctist needs to master in order to do her job effectively. Even the term “data science” can be somewhat nebulous, and as the field gains popularity it seems to lose definition. To help those new […]

10 Data Science Projects You Can Join Today

Editor’s note: This post was written as part of a collaboration with data.world, a site for sharing and hosting data. Authors Shannon Peifer and Gabriela Swider are on the data.world team. Finding the right data can be difficult. And even once you have it, how do you collaborate with others to make sense of it? […]

How to Write a Bootcamp Review that Actually Helps People

Editor’s note: This post was written as part of a collaboration with SwitchUp, an online platform for researching and reviewing technology learning programs. Erica Freedman is a Content and Client Services Specialist at SwitchUp. Data Science is a rapidly growing industry. From university programs to week-long cohorts, it can be difficult to decide where to […]

Introduction to Python Ensembles

Stacking models in Python efficiently Ensembles have rapidly become one of the hottest and most popular methods in applied machine learning. Virtually every winning Kaggle solution features them, and many data science pipelines have ensembles in them. Put simply, ensembles combine predictions from different models to generate a final prediction, and the more models we […]

1 2 3