DataDriven
LearnPracticeInterviewDiscussDailyJobs

A live metrics platform serves a real-time consumer dashboard from a streaming aggregation while a p

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

A live metrics platform serves a real-time consumer dashboard from a streaming aggregation while a producer team prepares a breaking schema change to the underlying event topic. The capstone composes the four intermediate-tier moves: a schema registry that rejects incompatible producer schemas at registration, an expand-contract phase plan to roll out the breaking change, an event_time-bucketed aggregation feeding the consumer, and a watermark plus allowed-lateness budget plus dead-letter sink so late events are caught or quarantined. Add a schema registry between producer and topic with its compatibility mode named, mark the four expand-contract phases on the canvas, route the aggregation by event_time with a named watermark strategy and 60-minute allowed-lateness budget, and add a dead-letter sink for events past the budget.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons