Loading interview...

Regulatory Data ETL Pipeline with Dynamic Schema Handling

A medium Pipeline Design mock interview question on DataDriven. Practice with AI-powered feedback, real code execution, and a hire/no-hire decision.

Domain
Pipeline Design
Difficulty
medium
Seniority
senior

Interview Prompt

We receive transaction reporting data from tens of thousands of regulated firms under MiFID II, and every firm formats their submission slightly differently. We need a pipeline that can ingest these files, normalize them to a canonical schema, validate them against regulatory rules, and produce an immutable record that auditors can query years later. Build it end-to-end and explain how you handle files whose structure changes without advance notice.

How This Interview Works

  1. Read the vague prompt (just like a real interview)
  2. Ask clarifying questions to the AI interviewer
  3. Write your pipeline design solution with real code execution
  4. Get instant feedback and a hire/no-hire decision