Home Data Engineering ETL & ELT Data Pipelines Tutorials Blog Databases Data Warehousing Big Data Cloud Data SQL Guides Python Guides Tools Glossary Resources About Contact
Tool Categories

The Categories Behind Every Data Stack

Understand what each category of tool does before comparing specific products.

💾

Databases

Systems for storing and querying structured or unstructured data.

📊

Data Warehouses

Analytical stores optimized for large-scale reporting queries.

🔄

Orchestration

Schedulers that coordinate pipeline tasks and dependencies.

🧩

Transformation

Frameworks for modeling and transforming data with SQL.

Processing Engines

Distributed compute engines like Apache Spark for large datasets.

Cloud Storage

Durable object storage for data lakes and raw file archives.

🔐

Data Quality

Tools and frameworks for validating and monitoring data.

📈

BI & Visualization

Dashboards and reporting tools built on top of warehouses.