Automation & Scripting
Python Guides
Python is the glue of modern data engineering — used for cleaning data, scripting pipelines, and orchestrating workflows.
Topic Areas
What You'll Learn
Python guides organized by concept, focused on real data engineering tasks.
🧵
Pandas Basics
DataFrames, series, and core data manipulation operations.
🧹
Data Cleaning
Handling missing values, duplicates, and type conversions.
📋
File & API Ingestion
Reading CSVs, JSON, and calling APIs to pull in data.
⚙
Automation Scripts
Writing reusable scripts for repeatable data tasks.
🔄
Working with Airflow
Using Python to define DAGs and orchestrate pipelines.
🔐
Testing & Validation
Writing simple checks to catch bad data early.
Start Here
Featured Python Tutorial
The most popular starting point for new Python learners.