DataDriven
LearnPracticeInterviewDiscussDailyJobs

Put it all together: design storage for a high-volume event lake that must avoid the small-file prob

A medium Pipeline Design mock interview question on DataDriven. Practice with AI-powered feedback, real code execution, and a hire/no-hire decision.

Domain
Pipeline Design
Difficulty
medium

Interview Prompt

Put it all together: design storage for a high-volume event lake that must avoid the small-file problem, push predicates to scan less, tier old data to cheap storage, and evolve its schema safely as new fields appear.

How This Interview Works

  1. Read the vague prompt (just like a real interview)
  2. Ask clarifying questions to the AI interviewer
  3. Write your pipeline design solution with real code execution
  4. Get instant feedback and a hire/no-hire decision

Related

  • All Mock Interviews
  • Practice Mode (untimed)
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Practice Problems
  • Daily Challenge