Six Million Rows Before the Market Opens
A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.
- Domain
- Pipeline Design
- Difficulty
- medium
- Seniority
- L5
Problem
A third-party vendor drops a CSV file every morning containing 6 million rows of time series financial data. Our quant team needs this data queryable in the warehouse before the market opens. Design a full ETL architecture to ingest this file.
Summary
One massive CSV. Millions of timestamps.
Practice This Problem
Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.