DataDriven
LearnPracticeInterviewDiscussDailyJobs

Three environments serve three different validation goals

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

Three environments serve three different validation goals. Dev catches typos and broken refs; CI catches schema regressions and unit-test failures; staging catches integration issues and scale problems that slim CI does not. The section's data-shape choice per environment: dev gets sample data committed to the repo (tens to thousands of rows); CI gets a slim-CI subset of recent prod (modified models plus descendants); staging gets a subset of prod (last 7 days) or masked full prod with PII columns hashed. Choose the env data shape by adding three environment storage nodes downstream of a masking pipeline whose name states the data shape and PII handling for each.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons