A daily orders pipeline runs as five bash scripts wired together by cron at 1:00am, 2:00am, 3:00am,
A medium Pipeline Design mock interview question on DataDriven. Practice with AI-powered feedback, real code execution, and a hire/no-hire decision.
- Domain
- Pipeline Design
- Difficulty
- medium
Interview Prompt
A daily orders pipeline runs as five bash scripts wired together by cron at 1:00am, 2:00am, 3:00am, 4:00am, and 5:00am. Each script implicitly assumes the previous one finished. The extract usually takes thirty minutes but sometimes runs ninety; on those nights the 2:00am clean job starts on time, sees no new output, and finishes successfully on stale data, so the morning dashboard shows numbers that look complete but are wrong. This section asks you to spot the failure, not fix it. Add ONE marker node to the canvas whose name calls out the broken boundary between the two specific cron-scheduled scripts where the silent stale-read first appears, and names the symptom (silent stale read, not a visible error). Do not add an orchestrator yet.
How This Interview Works
- Read the vague prompt (just like a real interview)
- Ask clarifying questions to the AI interviewer
- Write your pipeline design solution with real code execution
- Get instant feedback and a hire/no-hire decision