DataDriven
LearnPracticeInterviewDiscussDailyJobs

A daily orders pipeline runs as five bash scripts wired together by cron at 1:00am, 2:00am, 3:00am,

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

A daily orders pipeline runs as five bash scripts wired together by cron at 1:00am, 2:00am, 3:00am, 4:00am, and 5:00am. Each script implicitly assumes the previous one finished. The extract usually takes thirty minutes but sometimes runs ninety; on those nights the 2:00am clean job starts on time, sees no new output, and finishes successfully on stale data, so the morning dashboard shows numbers that look complete but are wrong. This section asks you to spot the failure, not fix it. Add ONE marker node to the canvas whose name calls out the broken boundary between the two specific cron-scheduled scripts where the silent stale-read first appears, and names the symptom (silent stale read, not a visible error). Do not add an orchestrator yet.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons