A daily orders pipeline runs as five bash scripts wired together by cron at 1:00am, 2:00am, 3:00am,
A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.
- Domain
- Pipeline Design
- Difficulty
- medium
Problem
A daily orders pipeline runs as five bash scripts wired together by cron at 1:00am, 2:00am, 3:00am, 4:00am, and 5:00am. Each script implicitly assumes the previous one finished. The extract usually takes thirty minutes but sometimes runs ninety; on those nights the 2:00am clean job starts on time, sees no new output, and finishes successfully on stale data, so the morning dashboard shows numbers that look complete but are wrong. This section asks you to spot the failure, not fix it. Add ONE marker node to the canvas whose name calls out the broken boundary between the two specific cron-scheduled scripts where the silent stale-read first appears, and names the symptom (silent stale read, not a visible error). Do not add an orchestrator yet.
Practice This Problem
Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.