The Modern Data Stack
Tools
An educational overview of the categories of tools used across data engineering — explained conceptually, without vendor pricing or sign-ups.
Tool Categories
The Categories Behind Every Data Stack
Understand what each category of tool does before comparing specific products.
💾
Databases
Systems for storing and querying structured or unstructured data.
📊
Data Warehouses
Analytical stores optimized for large-scale reporting queries.
🔄
Orchestration
Schedulers that coordinate pipeline tasks and dependencies.
🧩
Transformation
Frameworks for modeling and transforming data with SQL.
⚡
Processing Engines
Distributed compute engines like Apache Spark for large datasets.
☁
Cloud Storage
Durable object storage for data lakes and raw file archives.
🔐
Data Quality
Tools and frameworks for validating and monitoring data.
📈
BI & Visualization
Dashboards and reporting tools built on top of warehouses.