DataDriven
LearnPracticeInterviewDiscussDailyJobs

Six Million Rows Before the Market Opens

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium
Seniority
L5

Problem

A third-party vendor drops a CSV file every morning containing 6 million rows of time series financial data. Our quant team needs this data queryable in the warehouse before the market opens. Design a full ETL architecture to ingest this file.

Summary

One massive CSV. Millions of timestamps.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons