Stage 3: Data and Feature Engineering
A free 6-week data engineering bootcamp by Data with Zach, covering enterprise-level data engineering practices such as data modeling, big data processing, and real-time stream processing.
Data Engineering Boot Camp Course Introduction
Course Overview
- Instructor: Data with Zach
- Platform: YouTube
- Course Type: Free Data Engineering Boot Camp
- Total Views: Over 2.34 million views
- Number of Videos: 21 videos
Course Content Structure
Week 1 - Data Modeling Fundamentals
- 6-week Free Data Engineering Boot Camp Launch Video
- 6-week Data Engineering Boot Camp Kick off and Informational video
- Data Modeling - Complex Data Types and Cumulation - Day 1 Lecture (43:17)
- Data Modeling - Cumulative Dimensions, Struct and Array - Day 1 Lab (41:17)
- Data Modeling - Slowly Changing Dimensions and Idempotency - Day 2 Lecture (40:26)
- Data Modeling - Building Slowly Changing Dimensions (SCDs) - Day 2 Lab (45:39)
- Data Modeling - Graph Databases & Additive Dimensions - Day 3 Lecture (34:08)
- Data Modeling - Building an NBA Player Network Graph - Day 3 Lab (42:15)
Week 2 - Big Data Processing
- How Meta Models Big Volume Event Data - Full 4 Hour Course (3:51:54)
- Spark + Iceberg in 1 Hour - Memory Tuning, Joins, Partition - Week 3 Day 1 (1:15:02)
Week 3 - High-Performance Data Processing and Testing
- High Performance Spark in 1 hour - DataFrame, Dataset, UDFs, Caching - Week 3 Day 2 (1:10:17)
- Testing Apache Spark Jobs in CI/CD - Week 3 Day 3 (1:08:59)
- Build a Gold Pipeline like Airbnb MIDAS Process - Week 3 Day 1 Analytics (1:19:47)
- Master Data Contracts in 25 minutes! - Week 3 Day 2 Analytics (27:36)
- Master Real-time Data Pipelines with Kafka and Flink - 3 hr Course (2:31:36)
Week 4 - Enterprise Data Engineering Patterns
- Data Engineer Design Patterns at Meta - Growth Accounting - Week 4 Day 1 (1:27:15)
- Data Engineering Design Patterns at Meta - Funnel Analysis - Week 4 Day 1 (1:18:19)
- Job-Ready Capstone Projects for Analytics Engineering (32:50)
Week 5 - Product Thinking and KPIs
- Data Engineering like a Product Manager - KPIs & Experiments - Week 5 (2:37:06)
Week 6 - Data Pipeline Maintenance and Visualization
- Maintain Data Pipelines Like Netflix and Airbnb - Week 6 (2:24:43)
- Build Data-Driven Business Value with Tableau Viz - Week 6 (1:49:41)
Course Features
Technology Stack Coverage
- Apache Spark: Memory tuning, join operations, partitioning strategies
- Apache Kafka: Real-time data stream processing
- Apache Flink: Stream processing framework
- Iceberg: Data lake table format
- NBA Data: Graph database practice
- Tableau: Data visualization
Enterprise Practices
- Data engineering design patterns at Meta
- Data pipeline maintenance strategies at Netflix and Airbnb
- Large-scale event data modeling
- Data contract management
Project-Driven Learning
- Building an NBA player network graph
- Implementing the Airbnb MIDAS process
- End-to-end data pipeline development
- Real-time stream processing projects
Target Audience
- Data engineering beginners
- Developers looking to enhance big data processing skills
- Engineers interested in learning enterprise-level data architecture
- Technical professionals interested in real-time data processing
Learning Outcomes
- Master modern data engineering technology stack
- Understand enterprise-level data processing best practices
- Gain the ability to build scalable data pipelines
- Learn data modeling and architecture design thinking