A daily Python ETL pulls 250,000 orders and INSERTs them into a plain Snowflake table
A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.
- Domain
- Pipeline Design
- Difficulty
- medium
Problem
A daily Python ETL pulls 250,000 orders and INSERTs them into a plain Snowflake table. Last Tuesday the job crashed at row 180,001; on retry it appended all 250,000 again, leaving 430,000 rows and inflating revenue 41 percent. The section names this the append-plus-retry antipattern. Apply the structural fix: change how the destination writes so retries produce identical state.
Practice This Problem
Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.