DataDriven
LearnPracticeInterviewDiscussDailyJobs

A daily Python ETL pulls 250,000 orders and INSERTs them into a plain Snowflake table

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

A daily Python ETL pulls 250,000 orders and INSERTs them into a plain Snowflake table. Last Tuesday the job crashed at row 180,001; on retry it appended all 250,000 again, leaving 430,000 rows and inflating revenue 41 percent. The section names this the append-plus-retry antipattern. Apply the structural fix: change how the destination writes so retries produce identical state.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons