Stage 3: Data and Feature Engineering

A free 6-week data engineering bootcamp by Data with Zach, covering enterprise-level data engineering practices such as data modeling, big data processing, and real-time stream processing.

DataEngineeringApacheSparkBigDataYouTubeVideoFreeEnglish

Data Engineering Boot Camp Course Introduction

Course Overview

  • Instructor: Data with Zach
  • Platform: YouTube
  • Course Type: Free Data Engineering Boot Camp
  • Total Views: Over 2.34 million views
  • Number of Videos: 21 videos

Course Content Structure

Week 1 - Data Modeling Fundamentals

  1. 6-week Free Data Engineering Boot Camp Launch Video
  2. 6-week Data Engineering Boot Camp Kick off and Informational video
  3. Data Modeling - Complex Data Types and Cumulation - Day 1 Lecture (43:17)
  4. Data Modeling - Cumulative Dimensions, Struct and Array - Day 1 Lab (41:17)
  5. Data Modeling - Slowly Changing Dimensions and Idempotency - Day 2 Lecture (40:26)
  6. Data Modeling - Building Slowly Changing Dimensions (SCDs) - Day 2 Lab (45:39)
  7. Data Modeling - Graph Databases & Additive Dimensions - Day 3 Lecture (34:08)
  8. Data Modeling - Building an NBA Player Network Graph - Day 3 Lab (42:15)

Week 2 - Big Data Processing

  1. How Meta Models Big Volume Event Data - Full 4 Hour Course (3:51:54)
  2. Spark + Iceberg in 1 Hour - Memory Tuning, Joins, Partition - Week 3 Day 1 (1:15:02)

Week 3 - High-Performance Data Processing and Testing

  1. High Performance Spark in 1 hour - DataFrame, Dataset, UDFs, Caching - Week 3 Day 2 (1:10:17)
  2. Testing Apache Spark Jobs in CI/CD - Week 3 Day 3 (1:08:59)
  3. Build a Gold Pipeline like Airbnb MIDAS Process - Week 3 Day 1 Analytics (1:19:47)
  4. Master Data Contracts in 25 minutes! - Week 3 Day 2 Analytics (27:36)
  5. Master Real-time Data Pipelines with Kafka and Flink - 3 hr Course (2:31:36)

Week 4 - Enterprise Data Engineering Patterns

  1. Data Engineer Design Patterns at Meta - Growth Accounting - Week 4 Day 1 (1:27:15)
  2. Data Engineering Design Patterns at Meta - Funnel Analysis - Week 4 Day 1 (1:18:19)
  3. Job-Ready Capstone Projects for Analytics Engineering (32:50)

Week 5 - Product Thinking and KPIs

  1. Data Engineering like a Product Manager - KPIs & Experiments - Week 5 (2:37:06)

Week 6 - Data Pipeline Maintenance and Visualization

  1. Maintain Data Pipelines Like Netflix and Airbnb - Week 6 (2:24:43)
  2. Build Data-Driven Business Value with Tableau Viz - Week 6 (1:49:41)

Course Features

Technology Stack Coverage

  • Apache Spark: Memory tuning, join operations, partitioning strategies
  • Apache Kafka: Real-time data stream processing
  • Apache Flink: Stream processing framework
  • Iceberg: Data lake table format
  • NBA Data: Graph database practice
  • Tableau: Data visualization

Enterprise Practices

  • Data engineering design patterns at Meta
  • Data pipeline maintenance strategies at Netflix and Airbnb
  • Large-scale event data modeling
  • Data contract management

Project-Driven Learning

  • Building an NBA player network graph
  • Implementing the Airbnb MIDAS process
  • End-to-end data pipeline development
  • Real-time stream processing projects

Target Audience

  • Data engineering beginners
  • Developers looking to enhance big data processing skills
  • Engineers interested in learning enterprise-level data architecture
  • Technical professionals interested in real-time data processing

Learning Outcomes

  • Master modern data engineering technology stack
  • Understand enterprise-level data processing best practices
  • Gain the ability to build scalable data pipelines
  • Learn data modeling and architecture design thinking