DataDriven
LearnPracticeInterviewDiscussDailyJobs

Every Format Imaginable

A hard Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
hard
Seniority
L5

Problem

We're a health data company that aggregates records from hospitals, clinics, and labs. The data comes in every format imaginable: structured claims data, semi-structured HL7 messages, PDF lab reports, and free-text clinical notes. We need all of it in one place where analysts can query it. Design the data lake pipeline.

Summary

PDFs, HL7, JSON. All of it lands in the same lake.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons