The Analyst Who Saw the Salary Data
A hard Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.
- Domain
- Pipeline Design
- Difficulty
- hard
- Seniority
- L5
Problem
We operate a multi-tenant data lake used by five different business units, each with different data sensitivity levels and compliance requirements. Currently all data is stored in a flat structure with a single shared reader role. We have had two incidents where engineers accessed data from a different business unit. Design a file-level access control architecture for the data lake.
Summary
Two incidents. One shared lake. The access model was never designed, just assumed.
Practice This Problem
Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.