A platform team has two transforms wired backwards
A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.
- Domain
- Pipeline Design
- Difficulty
- medium
Problem
A platform team has two transforms wired backwards. A daily 12-table revenue join runs on a $4K/month Spark cluster pulling data out of Snowflake and writing it back; a PyTorch image-feature extraction runs as a dbt SQL UDF that silently writes nulls. Apply the section's ETL-vs-ELT rule and swap the two transforms so each runs on the right tool.
Practice This Problem
Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.