# A retailer's revenue pipeline runs Flink streaming with a $4,000/month bill, feeding a CFO dashboard

Canonical URL: <https://datadriven.io/problems/a-retailers-revenue-pipeline-runs-flink-streaming-with-a-4-6b2d0346>

Domain: Pipeline Design · Difficulty: medium

## Problem

A retailer's revenue pipeline runs Flink streaming with a $4,000/month bill, feeding a CFO dashboard that the CFO reads once each morning at 7am Pacific. The streaming pipeline was built without a cost conversation, and the latency it provides has zero dollar value because the consumer reads once a day. Apply the cost-story framing this section just taught: the latency value is zero, streaming costs 5-50x more than equivalent batch, the right answer is to downgrade. Replace the Flink streaming transform with nightly batch (plain Spark, dbt, or PySpark are all batch tools that satisfy this), remove the local RocksDB state store (batch maintains no inter-run state), tag the warehouse mart with slaFreshness < 24h to match the actual consumer freshness need, and remove the real-time slaFreshness from the streaming nodes. The CFO dashboard's freshness need is unchanged; only the cost is.

## Related

- [All practice problems](https://datadriven.io/problems)
- [Mock interview mode](https://datadriven.io/interview/a-retailers-revenue-pipeline-runs-flink-streaming-with-a-4-6b2d0346)
- [System Design Interview Questions](https://datadriven.io/data-engineering-system-design)
- [Data Engineering Interview Prep Guide](https://datadriven.io/data-engineer-interview-prep)
- [Daily Challenge](https://datadriven.io/daily)

---

Source: DataDriven (https://datadriven.io). 100% free data engineering interview prep. Live code execution against Postgres 16, Python 3.11, and Spark sandboxes. No paywall, no premium tier, no signup gate.