Loading...

Database Replication and Schema Normalization Pipeline

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium
Seniority
senior

Problem

Our product runs on a transactional database that the analytics team cannot query directly. We need to replicate it to a separate analytics system where they can run reports. The source data is messy and split across several tables that need to be normalized and combined before it is useful. Design a pipeline to replicate and prepare this data.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it instantly.