Getting Started with Kaggle
- Approach a Kaggle competition
- Explore the competition data and learn about the competition topic
- Prepare data for machine learning
- Train a model
- Measure the accuracy of your model
- Prepare and make your first Kaggle sublesson.
This lesson assumes you have an understanding of Python and the pandas library. If you need to learn about these, we recommend going through our Python Fundamentals course and our Numpy and Pandas course.
In this lesson and lessons to follow, we’ll be working with RMS Titanic passenger data to predict which passengers survived the Titanic disaster. By the end of this lesson, you’ll have created and trained your first Kaggle machine learning model.
Kaggle is a site where people create algorithms and compete against machine learning practitioners around the world. Your algorithm wins the competition if it’s the most accurate on a particular data set. Kaggle is a fun way to practice your machine learning skills.
As you work through each concept, you’ll get to apply what you’ve learned from within your browser so that there’s no need to use your own machine to do the exercises. The Python environment inside of this course includes answer checking so you can ensure that you’ve fully mastered each concept before learning the next concept.
- Learn how to approach a Kaggle competition and explore the competition data.
- Learn techniques for cleaning and preparing data for machine learning.
- Learn how to train a machine learning model and make your first Kaggle sublesson.
- Introduction to Kaggle
- Exploring the Data
- Exploring and Converting the Age Column
- Preparing our Data for Machine Learning
- Creating Our First Machine Learning Model
- Splitting Our Training Data
- Making Predictions and Measuring their Accuracy
- Using Cross Validation for More Accurate Error Measurement
- Making Predictions on Unseen Data
- Creating a Sublesson File
- Making Our First Sublesson to Kaggle
- Next Steps