# A public health-tech company has run on a 1.4 PB Snowflake warehouse plus a separate 6 PB raw S3 lak

Canonical URL: <https://datadriven.io/problems/a-public-health-tech-company-has-run-on-a-14-pb-snowflake-w-0e33f810>

Domain: Pipeline Design · Difficulty: medium

## Problem

A public health-tech company has run on a 1.4 PB Snowflake warehouse plus a separate 6 PB raw S3 lake for four years. Two answers exist for every metric and storage costs grew 4x year over year. The new chief data officer commits to a unified lakehouse architecture. Apply the entire L3 tier so a single lakehouse archive feeds the warehouse mart, ML training, and regulator audits, with the operational app keeping its own store and a compaction job keeping file size in target.

## Related

- [All practice problems](https://datadriven.io/problems)
- [Mock interview mode](https://datadriven.io/interview/a-public-health-tech-company-has-run-on-a-14-pb-snowflake-w-0e33f810)
- [System Design Interview Questions](https://datadriven.io/data-engineering-system-design)
- [Data Engineering Interview Prep Guide](https://datadriven.io/data-engineer-interview-prep)
- [Daily Challenge](https://datadriven.io/daily)

---

Source: DataDriven (https://datadriven.io). 100% free data engineering interview prep. Live code execution against Postgres 16, Python 3.11, and Spark sandboxes. No paywall, no premium tier, no signup gate.