DataDriven
LearnPracticeInterviewDiscussDailyJobs

Put it all together: design storage for a high-volume event lake that must avoid the small-file prob

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

Put it all together: design storage for a high-volume event lake that must avoid the small-file problem, push predicates to scan less, tier old data to cheap storage, and evolve its schema safely as new fields appear.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons