top of page

Hands On Tutorials to Implement Concepts

The best engineers have a strong appreciation for the pros and cons of a solution. 

With each exercise below we train the reflection muscle to further connect the dots. 

Data Flow

  • DF001: ETL vs. ELTL: Compared single-step ETL vs. multi-stage ELTL for tradeoffs in simplicity, scalability, and raw data retention.

  • DF002: Scheduling & Pipelines: Built orchestrated Dataflows w/advanced scheduling & dependency control, optimizing refresh strategies.

  • DF003: Data Type Conversion: Standardized inconsistent CSV datatypes & generated optimized Warehouse tables, assess schema tradeoffs.

  • DF004: Data Cleansing: Transformed messy audiobook dataset into an analyst-ready model, balancing complexity with performance.

image.png
bottom of page