Staff Data Engineer Enterprise Architecture
CortevaThe Patients We Cannot Move
We run federated machine learning across hospital networks for clinical trial research. Each hospital has patient data we're not allowed to move - privacy law and patient consent don't permit central aggregation. We need to train models and compute population statistics across data that is physically distributed across 40 hospitals in 8 countries, each with different EHR systems and data formats. Design a data pipeline that makes this possible.
Ask the interviewer clarifying questions to understand the requirements and constraints before designing.
When you're ready, click Ready to Design to start building.
The Patients We Cannot Move
A hard Pipeline Design mock interview question on DataDriven. Practice with AI-powered feedback, real code execution, and a hire/no-hire decision.
- Domain
- Pipeline Design
- Difficulty
- hard
- Seniority
- staff
Interview Prompt
We run federated machine learning across hospital networks for clinical trial research. Each hospital has patient data we're not allowed to move - privacy law and patient consent don't permit central aggregation. We need to train models and compute population statistics across data that is physically distributed across 40 hospitals in 8 countries, each with different EHR systems and data formats. Design a data pipeline that makes this possible.
How This Interview Works
- Read the vague prompt (just like a real interview)
- Ask clarifying questions to the AI interviewer
- Write your pipeline design solution with real code execution
- Get instant feedback and a hire/no-hire decision