Data Engineering

Data Pipelines

Batch Processing vs. Stream Processing

Common Concerns

Stewardship

  • Data Governance
  • Data Privacy
  • Data Security
  • Data Lineage

Organization

  • Data Versioning
  • Data Catalog
  • Data Enrichment
  • Data Aggregation
  • Data Partitioning
  • Data Locality
  • Data Transformation
  • Data Normalization and Denormalization
  • Data Deduplication

Quality and Performance

  • Data Quality
  • Data Serialization
  • Data Compression
  • Data Validation

  • Data Monitoring
  • Data Profiling

Data Processing Workloads

ETL

ELT

Data Warehousing

Data Lakes, Lakehouses, Fabrics, and Reservoirs

Data Marts

Data Hubs

Data Catalogs