Interview Guide · 2026

Amazon Data Engineer Interview in Washington DC (L5)

Hiring for Data Engineer at Amazon (L5) runs Leadership Principles woven into every round, with a Bar Raiser holding veto power. The hiring bar is shipped production pipelines end-to-end and can debug them when they break; the median candidate brings 2-5 years of DE experience. This guide covers the Washington DC (Washington DC / Arlington / Northern VA) hiring office, including local compensation bands and market context.

Compensation

$140K–$167K base • $207K–$261K total (L5)

Loop duration

3.8 hours onsite

Rounds

5 rounds

Location

Washington DC / Arlington / Northern VA

Compensation

Amazon Data Engineer in Washington DC total comp

Across 292 samples

Offer-report aggregate, 2020-2026. Level mapped: L5. Typical experience: 5-10 years (median 7).

25th percentile

$164K

Median total comp

$208K

75th percentile

$260K

Median base salary

$150K

Median annual equity

$47K

Median total comp by year

2020
$229K n=4
2021
$216K n=35
2022
$220K n=53
2023
$201K n=56
2024
$189K n=43
2025
$192K n=56
2026
$219K n=45

5 currently open data engineer postings in Washington DC.

Tech stack

What Amazon data engineers actually use

Across 5 open roles

What Amazon currently advertises as required for data engineer roles in Washington DC. Chips link into tool-specific interview guides.

S35Kinesis5Lambda5AWS5Redshift5EMR5Glue5SQL4Python4Scala2Java2SQS1PostgreSQL1Athena1

Round focus

Domain concentration by round

Across 5 job descriptions

Per-round concentration of each domain in Amazon's interview, derived from the skills emphasized across 5 current data engineer postings. Higher bars mean more questions of that type in that round.

Online Assessment

Python89%
SQL39%
Architecture16%

Phone Screen

SQL66%
Python66%
Architecture31%
Modeling9%

Onsite Loop

Architecture67%
Modeling33%
Python29%
SQL28%

Practice problems

Amazon data engineer practice set

4 problems

Amazon data engineer practice set, mapped from predicted domain emphasis. Tap into any problem to work it in the live environment.

Pythonmedium~20 min

The Coin Vault

Given a target amount and a list of coin denominations, return the minimum coins needed using a greedy strategy: repeatedly take the largest coin that does not exceed the remaining amount. Return -1 if the greedy approach cannot make exact change.

Open in practice environment
Modelingmedium~35 min

Machine Process Event Log Schema

We collect structured logs from a fleet of machines. Each machine runs many processes, and we need to track when each process runs and how long it takes. Data scientists need to query metrics like average elapsed time per process and plot process timelines across machines. Design the data model, and describe how you'd load this data via an ETL.

Open in practice environment
Architecturehard~10 min

City-Wide Bicycle Demand Forecasting Pipeline

We run a bike-share network across dozens of cities and need to predict hourly bike demand at each station so we can pre-position bikes before rush hour. The raw data comes from multiple city operators and external sources in different schemas and formats. Design the end-to-end pipeline from raw ingestion through to model-ready features.

Open in practice environment
Modelinghard~40 min

Content Engagement Data Model

We run a large social content platform. Creators publish posts (text, images, video). Users engage through views, reactions, comments, and shares. The product team needs a data model to power dashboards for content virality, creator performance, and feed ranking signals. Data visualization is also required. Sketch how a virality chart would query this model.

Open in practice environment
Try itTop 2 sellers by revenue in each marketplace

Classic DE round opener. Window function + partition. Edit to tweak the threshold.

top_sellers.sql
Click Run to execute. Edit the code above to experiment.

Washington DC / Arlington / Northern VA

Amazon in Washington DC

Amazon HQ2 anchors DE hiring. Gov-adjacent work (AWS GovCloud, defense tech) is common. Clearance-required roles pay a premium.

Offers in Washington DC typically trail the reference band by around 10%, reflecting a lower cost of living. Washington DC candidates run the same loop as global peers; the differences show up in team assignment and local comp calibration.

The loop

How the interview actually runs

01Recruiter screen

30 min

Logistics, team fit, and a light Leadership Principle question. Recruiters confirm seniority expectations before booking the loop. Misalignment here can downlevel the loop.

  • Have a 60-second pitch that names 2-3 concrete data systems you've built
  • Confirm the team. Amazon has hundreds of DE teams across AWS, Retail, Ads, Alexa, Prime Video, Pharmacy
  • Ask about the comp band early to avoid end-of-loop misalignment

02Technical phone screen

60 min

One SQL problem, one Python or pipeline design problem, and 10-15 min of Leadership Principle questions. The SQL is harder than the Online Assessment, expect multi-step window functions or self-joins.

  • Narrate approach before writing code. Amazon grades process, not just the final answer
  • Name the LP before telling the story
  • Prepare at least 2 stories per LP; follow-ups probe a third story on the same theme

03Onsite: SQL deep-dive

60 min

Two to three SQL problems with escalating difficulty, usually in Amazon contexts (seller performance, order fulfillment, inventory). Ends with 10 min of LP questions.

  • Practice window functions across large partition cardinalities
  • Be ready to rewrite correlated subqueries as joins and vice versa
  • When asked about optimization, mention partition pruning and columnar storage

04Onsite: Bar Raiser

60 min

An interviewer from outside the hiring team with veto power. Heaviest on Leadership Principles, with one harder technical problem. Tests whether you raise Amazon's hiring bar.

  • Bring a story where you were wrong and had to change course
  • Quantify impact: cost saved, latency reduced, users affected
  • If you don't know something, say so. Fabricating kills the loop faster than any technical gap

Level bar

What Amazon expects at Data Engineer

Pipeline ownership

Mid-level DEs own pipelines end-to-end. Interviewers expect stories about designing, deploying, and maintaining a data pipeline that has been in production for 6+ months.

SQL + Python or Spark fluency

SQL is the floor. Most teams also expect fluency in either Python for data manipulation (pandas, airflow DAGs) or Spark for larger-scale processing.

On-call debugging

You should have concrete stories about production incidents: what alert fired, how you diagnosed, what you fixed, and what post-mortem action you owned.

Amazon-specific emphasis

Amazon's loop is characterized by: Leadership Principles woven into every round, with a Bar Raiser holding veto power. Calibrate your preparation to that, generic FAANG prep will not close the gap on company-specific expectations.

Behavioral

How Amazon frames behavioral rounds

Dive Deep

The most relevant LP for data engineers. Amazon wants DEs who trace anomalies through 3+ layers of the stack instead of patching symptoms.

Tell me about a time you found a data quality issue that others had missed.

Ownership

You built it, you own it, including on-call and long-term maintenance. Ownership extends beyond your explicit scope when dependencies break.

Describe a situation where you went beyond your role to solve a problem.

Bias for Action

Speed beats perfection. Amazon wants DEs who ship V1 in 2 weeks rather than a perfect solution in 3 months.

Tell me about a time you made a decision without having all the information.

Earn Trust

Trust comes from delivery and transparency. Bar Raisers test whether you can admit mistakes and communicate setbacks without spinning.

Tell me about a time you delivered bad news to a stakeholder.

Prep timeline

Week-by-week preparation plan

8-10 weeks out
01

Foundations and gap analysis

  • ·Do 10 medium SQL problems. Note which patterns feel slow
  • ·Write out 2-3 behavioral stories per value, Amazon weights this round heavily
  • ·Read Amazon's public engineering blog for recent architecture patterns
  • ·Review your prior production work, pick 3-5 projects you can discuss in depth
6 weeks out
02

SQL and coding fluency

  • ·Practice window functions until DENSE_RANK, ROW_NUMBER, LAG, LEAD are reflex
  • ·Do 20+ Amazon-style problems in their domain
  • ·Time yourself: 25 min per medium, 35 min per hard
  • ·Record yourself narrating approach aloud, communication is graded
4 weeks out
03

Pipeline awareness and behavioral depth

  • ·Review pipeline architecture basics: idempotency, partitioning, backfill
  • ·Practice explaining a pipeline you've worked on end-to-end in 5 minutes
  • ·Refine behavioral stories based on mock feedback
  • ·Do 10 more SQL problems at medium difficulty
2 weeks out
04

Behavioral polish and mock loops

  • ·Rehearse every story out loud. Cut to 2-3 minutes each
  • ·Run 2 full mock loops with a mid-level DE or coach
  • ·Identify your 3 weakest behavioral areas and draft additional stories
  • ·Review recent Amazon news or earnings call for fresh talking points
Week of
05

Taper and logistics

  • ·No new content. Review your notes only
  • ·Sleep. Mental energy matters more than one more practice problem
  • ·Confirm logistics: laptop charged, shared-doc tool tested, snack and water nearby
  • ·Remember: interviewers want to find reasons to hire you, not to reject you

FAQ

Common questions

What level is Data Engineer at Amazon?
Data Engineer maps to L5 on Amazon's engineering ladder. This is an individual contributor level; expectations focus on shipped production pipelines end-to-end and can debug them when they break.
How much does a Amazon Data Engineer in Washington DC make?
Based on 292 offer samples covering 2020-2026, Amazon Data Engineer in Washington DC sees $164K at the 25th percentile, $208K at the median, and $260K at the 75th percentile, median base $150K and median annual equity $47K. Typical experience range: 5-10 years..
Does Amazon actually hire data engineers in Washington DC?
Yes, Amazon maintains a Washington DC office and hires Data Engineer data engineers there. Team assignment may be office-locked or global; confirm with the recruiter before the loop.
How is the Data Engineer loop different from other levels at Amazon?
The rounds look similar, but the bar calibrates to seniority. Data Engineer is evaluated on shipped production pipelines end-to-end and can debug them when they break. Questions at this level probe production pipeline ownership and on-call debugging.
How long should I prepare for the Amazon Data Engineer interview?
Plan for 6-8 weeks of prep if you're already a working DE. Under 4 weeks rushes the behavioral prep, which takes the most time.
Does Amazon interview data engineers differently than software engineers?
They differ meaningfully. Amazon's DE loop has heavier SQL, replaces the general system-design with a data-specific one (pipelines, warehouse design), and expects production data ops experience.

Continue your prep

Data Engineer Interview Prep, explore the full guide

50+ guides covering every round, company, role, and technology in the data engineer interview loop. Grounded in 2,817 verified interview reports across 929 companies, collected from real candidates.