A regional bank closes the books at midnight and needs the leadership meeting at 8am to read the day
A medium Pipeline Design mock interview question on DataDriven. Practice with AI-powered feedback, real code execution, and a hire/no-hire decision.
- Domain
- Pipeline Design
- Difficulty
- medium
Interview Prompt
A regional bank closes the books at midnight and needs the leadership meeting at 8am to read the day's transactions on a dashboard. The canvas already has the endpoints of the nightly batch shape: a MySQL transactions source on the left and a Looker executive dashboard on the right. Apply the canonical nightly-batch run this section just walked through and add the missing middle: a raw-zone storage node (S3, GCS, or ADLS) where the nightly extract lands as a partitioned file, a curated transform (dbt is the standard) that reads raw and writes the daily_balances table, a warehouse (Snowflake, BigQuery, Redshift, or Databricks) that holds the curated table, and an orchestrator (Airflow, Dagster, or Prefect) that wakes the job once a day at 2am Pacific. Tag the curated warehouse table with a daily slaFreshness (< 24h) so the contract to the morning consumer is named. The pipeline runs once per day; nothing on this canvas runs continuously.
How This Interview Works
- Read the vague prompt (just like a real interview)
- Ask clarifying questions to the AI interviewer
- Write your pipeline design solution with real code execution
- Get instant feedback and a hire/no-hire decision