DataDriven
LearnPracticeInterviewDiscussDailyJobs

A rideshare company surfaces driver locations on a live ops map

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

A rideshare company surfaces driver locations on a live ops map. Driver phones publish GPS pings to a Kinesis stream every 3 seconds, and an ops dashboard polls the latest location per driver every second. The canvas has the Kinesis source; the rest of the canonical streaming shape this section just walked through is missing. Trace one event from the source through the pipeline and add: a streaming consumer process (Flink, Spark Structured Streaming, Kafka Streams, or Beam) that reads events as they arrive (plain Spark and dbt are batch tools and do not satisfy the streaming engine role), a serving store the dashboard can poll for the latest location per driver, and the live ops dashboard consumer. Tag every downstream node with a real-time-tier slaFreshness (real-time or < 1min) so the diagram makes the rhythm visible. Nothing on this canvas runs on a schedule.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons