DataDriven
LearnPracticeInterviewDiscussDailyJobs

A platform team gets a $48,000 monthly Snowflake bill with no breakdown by pipeline

A medium Pipeline Design interview practice problem on DataDriven. Write and execute real pipeline design code with instant grading.

Domain
Pipeline Design
Difficulty
medium

Problem

A platform team gets a $48,000 monthly Snowflake bill with no breakdown by pipeline. Nobody can say which pipelines should be optimized, which should be retired, or which are growing fastest. The section's pattern: thread a stable identifier through every query the warehouse runs (Snowflake QUERY_TAG, BigQuery job labels, Databricks tags, Redshift query groups). The minimum tag set is pipeline_id and run_id; team is useful for chargeback. Tag the query by replacing the dbt build transform with one whose name states the QUERY_TAG it sets at session start, and add a cost-rollup destination that reads QUERY_HISTORY and groups credits by pipeline.

Practice This Problem

Solve this Pipeline Design problem with real code execution. DataDriven runs your solution and grades it automatically.

Related

  • All Practice Problems
  • Mock Interview Mode
  • System Design Interview Questions
  • Data Engineering Interview Prep Guide
  • Daily Challenge
  • Data Engineering Lessons