The Dataquest Download

Level up your data and AI skills, one newsletter at a time.

Each week, the Dataquest Download brings the latest behind-the-scenes developments at Dataquest directly to your inbox. Discover our top tutorial of the week to boost your data skills, get the scoop on any course changes, and pick up a useful tip to apply in your projects. We also spotlight standout projects from our students and share their personal learning journeys.

Hello, Dataquesters!

Here’s what we have in store for you in this edition:

Top Read: Master prompt engineering by crafting clear, effective prompts to get better results from AI tools. Learn more

From the Community: Confused about when to use GitHub Repositories vs. Gists? Discover the key differences, strengths, and use cases to help you choose the right tool for your next project. Learn more

New Resource: Build a machine learning model to predict heart disease risk—covering data cleaning, feature selection, and model building. Learn more

Introduction to Prompt Engineering for Data Professionals

AI tools like ChatGPT can supercharge your data work but you need to know how to communicate with them. In this tutorial, you’ll discover how to craft precise, strategic prompts that turn vague AI outputs into clear, actionable insights. Learn techniques to save time, boost accuracy, and make AI a reliable partner in your data workflow.

From the Community

Heart Disease Prediction Kaggle Competition Winners: Pastor, the competition’s organizer, has announced the 3 lucky winners. Find out their names and be ready for the next competition!

Hacker News Pipeline: Ramesh has created a reproducible pipeline to identify the top 100 keywords on HackerNews in 2014. The project is brief and straight to the point because it relies on the use of @pipeline decorator, which allowed Ramesh to split complicated work into easily adjustable chunks.

Python Variable Naming Convention: Raisa gives an exhaustive explanation of variable naming best practices in Python and why they’re important.

Introduction to Python Dictionaries: Raisa provides a detailed, comprehensive, and well-exemplified introduction to Python dictionaries, illustrating the theory with a use case.

GitHub repositories vs. Gists: Anna provides a quick but helpful comparison of GitHub repositories and Gists outlining the strong and weak points of both.

DQ Resources

Build a Machine Learning Model: Learn to analyze patient data and build a machine learning model that predicts heart disease risk. This project covers data cleaning, exploratory analysis, feature selection, and model building—essential skills for aspiring data professionals. Learn more

Build A Python Word Guessing Game: Learn to create a Wordle-style word guessing game in Python. A fun way to practice programming fundamentals like loops, logic, and object-oriented design. Great for beginners looking to build interactive projects. Learn more

Analyzing Kaggle Data Science Survey: Practice core Python skills like lists, loops, and conditionals while exploring trends in programming languages and compensation among data scientists. Learn more

What We're Reading

Dealing with Highly Skewed Data: A Practical Guide: Learn how to identify and transform skewed data for better analysis in your data science projects.

AI Agents Try Running a Company—Here’s What Happened: An experiment conducted at Carnegie Mellon University tested AI agents managing a company, leading to missed deadlines, strange hacks, and a revealing look at AI’s limitations.

Give 20%, Get $20: Time to Refer a Friend!

Give 20% Get $20

Now is the perfect time to share Dataquest with a friend. Gift a 20% discount, and for every friend who subscribes, earn a $20 bonus. Use your bonuses for digital gift cards, prepaid cards, or donate to charity. Your choice! Click here

High-fives from Vik, Celeste, Anna P, Anna S, Anishta, Bruno, Elena, Mike, Daniel, and Brayan.

2025-06-25

Struggling with Slow Python Scripts and Crashing Excel files?

Explore PySpark locally, build your first Spark app, master ETL pipelines with Airflow on AWS, and learn from impressive community projects. Read More
2025-06-19

Build a Linear Regression Model Using Python

Forecast gym visits, explore traffic patterns, test cloud providers hands-on, and build machine learning skills with real healthcare data. Read More
2025-06-11

Build Your First Automated ETL Pipeline with Airflow and Docker

Learn how to automate ETL with Airflow, compare cloud providers hands-on, and discover regex, SQL, and engagement tips from the community. Read More

Learn faster and retain more.
Dataquest is the best way to learn