Company Interview Guide

Airbnb Data Engineer Interview

Airbnb pioneered the modern data culture and created Apache Airflow. Their DE interviews test marketplace data modeling, experimentation infrastructure, and the ability to make data accessible across an entire organization. The process spans 3 to 5 weeks with leveling from L4 through L7.

Airbnb DE Compensation (2026)

Total compensation includes base salary, RSUs on a 4-year vest schedule, and annual refresh grants. Airbnb RSUs vest quarterly after the first year.

Ranges reflect total comp (base + RSUs + bonus) for US-based roles. Actual offers vary by location, experience, and negotiation.

IC1 (L4)Data Engineer
$160K to $240K

Entry-level for new grads or candidates with 1 to 3 years of experience

IC2 (L5)Senior Data Engineer
$260K to $400K

Most common hiring level for experienced candidates

IC3 (L6)Staff Data Engineer
$380K to $520K

Requires demonstrated cross-team technical leadership

IC4 (L7)Principal Data Engineer
$500K to $700K+

Rare external hires; typically internal promotions with org-wide impact

Airbnb DE Interview Process

Three stages from recruiter call to offer decision. Timeline: 3 to 5 weeks.

1

Recruiter Screen

30 min

Initial conversation about your experience and interest in Airbnb. The recruiter evaluates your background with data platforms, pipeline orchestration, and experimentation infrastructure. Airbnb has one of the strongest data cultures in tech, so they look for candidates who understand how data drives product decisions and can articulate the role of data engineering in enabling that culture.

*Airbnb created Apache Airflow; mention your experience with it or similar orchestration tools
*Show awareness of Airbnb's data culture: they train all employees on data literacy
*Ask about the team: Search, Payments, Trust and Safety, or Data Platform have different focuses
2

Technical Phone Screen

60 min

SQL problems set in a marketplace context: bookings, listings, reviews, search interactions. Airbnb phone screens test aggregation, window functions, and multi-step analytical queries. The interviewer evaluates clarity of thought and how you handle ambiguity in problem definitions. You may need to ask clarifying questions about business logic before writing queries.

*Practice marketplace SQL: two-sided metrics (host vs guest), conversion funnels, and seasonality
*Ask clarifying questions; Airbnb interviews reward candidates who define the problem before solving it
*Write readable SQL with meaningful aliases; Airbnb values code that others can maintain
3

Onsite Loop

5 to 6 hours

Five rounds covering SQL deep dive, system design, coding (Python), data modeling, and a core values interview. System design at Airbnb emphasizes experimentation platforms, search ranking data pipelines, and pricing analytics. The core values interview evaluates alignment with Airbnb's mission and how you champion the data culture. Data modeling often involves marketplace schemas that support both operational and analytical workloads.

*Know Airbnb's Minerva metrics layer; it standardizes metric definitions across the company
*System design should address experimentation: how the pipeline supports A/B tests and causal inference
*The core values round is serious; prepare stories about belonging, championing the mission, and hosting

Airbnb Data Tech Stack

The tools and systems Airbnb's data engineering org builds on.

Languages

Python, Java, Scala

Core Frameworks

Apache Spark, Apache Airflow (created at Airbnb), Apache Hive

Storage

S3, Parquet, custom data lake architecture

Metrics Layer

Minerva (Airbnb's centralized metrics platform, single source of truth for all metric definitions)

Query Engines

Presto/Trino, Spark SQL

Orchestration

Apache Airflow (internal fork with Airbnb-specific operators)

ML Platform

Bighead (Airbnb's end-to-end ML infrastructure)

Airbnb DE Team Structure

Data engineers at Airbnb are embedded across these major teams. Each has distinct technical focus areas and interview emphasis.

Search and Discovery

Ranking pipelines, search relevance features, listing quality signals, personalization data

Trust and Safety

Fraud detection, identity verification, risk scoring, trust signals across hosts and guests

Payments and Financial

Transaction pipelines, pricing models, payout reconciliation, tax and compliance data

Data Platform

Airflow infrastructure, Minerva development, data quality tooling, schema management

Host and Guest Analytics

Two-sided marketplace metrics, retention funnels, lifetime value models, engagement tracking

Pricing and Revenue Management

Dynamic pricing models, demand forecasting, occupancy optimization, smart pricing features

12 Example Questions with Guidance

Real question types from each round. The guidance shows what the interviewer evaluates and how to structure your answer.

SQL

Calculate the booking conversion rate by city and month: searches that resulted in a booking within 24 hours.

Join searches to bookings on user_id where booking_ts BETWEEN search_ts AND search_ts + INTERVAL 24 HOUR. Group by city and month. Discuss attribution: what if a user searched multiple cities before booking?

SQL

Find hosts whose average response time to booking requests increased by more than 50% compared to their previous quarter.

Calculate avg response time per host per quarter. Use LAG to get prior quarter. Filter where current > 1.5 * prior. Discuss how to handle hosts with very few requests (small sample sizes).

SQL

Identify listings that have never been booked but received more than 100 search impressions in the last 90 days.

LEFT JOIN listings to bookings, filter WHERE booking_id IS NULL. Join to search_impressions, group by listing, HAVING COUNT >= 100. Discuss what this indicates about listing quality or pricing.

SQL

Compute a rolling 7-day trust score for guests based on verified identity, reviews from hosts, response rate, and cancellation history.

Weight each signal (e.g., identity = 30%, host reviews = 30%, response rate = 20%, cancellations = 20%). Use window functions with ROWS BETWEEN 6 PRECEDING AND CURRENT ROW. Discuss how to handle new guests with no history and the cold-start problem.

Python

Write a data quality check that validates daily booking data: row count within expected range, no future dates, all required fields non-null, and referential integrity with listings.

Define check functions for each validation. Return a structured report with pass/fail per check. Discuss how to set dynamic thresholds (e.g., expected row count based on day of week and season) and how to handle check failures in a pipeline.

Python

Build a pipeline that detects anomalies in host pricing changes: sudden spikes during events, coordinated pricing across listings, or prices that deviate from neighborhood averages.

Compare each price change against historical distribution for that listing and its neighborhood. Use z-scores or IQR for outlier detection. Discuss how to distinguish legitimate event pricing from manipulation, and how to handle listings with limited price history.

System Design

Design Airbnb's experimentation data platform that computes metrics for thousands of concurrent A/B tests.

Event collection, experiment assignment logging, metric computation (pre-experiment and during), statistical analysis pipeline. Discuss metric standardization (Minerva), avoiding p-hacking with pre-registered metrics, and how to handle interaction effects between overlapping experiments.

System Design

Design a search ranking feature pipeline that provides real-time listing quality signals.

Batch features (historical booking rate, review scores) from Spark, real-time features (current availability, recent views) from a feature store. Discuss feature freshness requirements, how to backfill features for new listings, and serving latency constraints for search.

System Design

Design a dynamic pricing recommendation engine that suggests nightly rates to hosts based on demand, seasonality, local events, and comparable listings.

Ingest demand signals (search volume, booking pace), event calendars, and neighborhood comps. Batch compute base recommendations daily with Spark; update intra-day for demand spikes. Discuss how to handle new listings (cold start), host override behavior, and feedback loops where pricing affects demand.

Data Modeling

Model Airbnb's marketplace data to support both host analytics and guest conversion funnels.

Fact: bookings, searches, reviews, messages. Dimensions: listings, hosts, guests, locations. Discuss the two-sided marketplace: host metrics (occupancy rate, response time, revenue) vs guest metrics (search-to-book conversion, repeat booking rate). Define grain carefully for each fact table.

Data Modeling

Design a data model for Airbnb's host and guest funnel that tracks every step from initial search through post-stay review, supporting both real-time dashboards and weekly cohort analysis.

Define funnel stages: search impression, listing view, booking request, host response, booking confirmed, check-in, review submitted. Use a session-based fact table keyed on search_session_id. Discuss how to handle funnel re-entry (guest who views the same listing twice), attribution across devices, and how Minerva would define standard funnel metrics.

Behavioral

Tell me about a time you made data more accessible to non-technical stakeholders.

Airbnb's data culture means DEs serve analysts, PMs, and executives. Describe building self-serve tools, documentation, or training. Quantify impact: 'Reduced analyst time-to-answer from 2 days to 30 minutes' or 'Enabled 50 PMs to query experiment results independently.'

Common Mistakes in Airbnb DE Interviews

Patterns that consistently lead to rejections, and how to avoid them.

Treating Airbnb like a standard e-commerce company

Airbnb is a two-sided marketplace. Every metric has a host perspective and a guest perspective. Candidates who only think about the buyer side miss half the picture. When modeling bookings, consider occupancy rate (host metric) alongside conversion rate (guest metric).

Not understanding Minerva's role

Minerva is Airbnb's centralized metrics layer. It defines how every metric is calculated so that teams cannot produce conflicting numbers. If you propose a system design without mentioning metric standardization or a single source of truth, you are ignoring a core part of Airbnb's data philosophy.

Generic Airflow knowledge without depth

Airbnb created Airflow. Surface-level knowledge (DAGs, operators, scheduling) is the minimum. Interviewers expect you to discuss backfill strategies, idempotent task design, dynamic DAG generation, SLA monitoring, and how to handle task failures gracefully in production.

Ignoring seasonality and geographic variation

Airbnb's business is highly seasonal and location-dependent. A pipeline that works for New York in July will produce very different volumes for rural Japan in February. Always mention how your design handles variable data volumes, seasonal spikes, and regional differences.

Skipping the core values round preparation

The core values interview is a real evaluation round with veto power. Candidates who prepare only for technical rounds and wing the values discussion get rejected. Prepare specific stories that demonstrate belonging, hosting, and championing the mission.

What Makes Airbnb Different

Why preparing for Airbnb requires a different approach than other top-tier companies.

Airbnb created Apache Airflow

Airflow was born as an internal Airbnb project in 2014 before becoming an Apache top-level project. This means Airbnb's data infrastructure runs on a deeply customized fork with proprietary operators, monitoring, and SLA tooling. The internal engineering culture treats orchestration as a first-class discipline, not just a scheduler.

Minerva is the single source of truth for all metrics

Unlike most companies where metric definitions live in scattered dashboards and notebooks, Airbnb built Minerva to centralize every metric definition. DEs are responsible for registering metrics in Minerva and ensuring pipelines produce data that conforms to these definitions. This changes how you think about pipeline design: output is not just a table, it is a certified metric.

Two-sided marketplace complexity compounds everything

Most data engineering roles deal with one type of customer. At Airbnb, every feature, every metric, and every pipeline must account for both hosts and guests. A cancellation pipeline must update host availability, guest booking history, trust scores for both parties, and financial records. This multiplier effect makes even simple-sounding problems significantly harder.

Data culture is company-wide, not just an engineering initiative

Airbnb invests in data literacy training for all employees. Product managers, designers, and operations staff are expected to query data and interpret experiment results. DEs build infrastructure that serves the entire company, not just analysts. This means you are evaluated on how well you make data accessible, not just how well you build pipelines.

Airbnb-Specific Preparation Tips

Tactical advice for each aspect of the interview loop.

Airbnb created Airflow and has deep orchestration expertise

Apache Airflow was born at Airbnb. Interviewers expect you to understand DAG-based orchestration, dependency management, idempotent tasks, and retry strategies. If you use Airflow, know it well. If you use a competitor (Dagster, Prefect), be ready to compare.

Data culture is not a slogan, it is the operating model

Airbnb trains all employees on data literacy and has a centralized metrics layer (Minerva). DEs are expected to help the entire organization, not just serve other engineers. Prepare examples of making data accessible to non-technical users.

Marketplace metrics require two-sided thinking

Every Airbnb metric has a host side and a guest side. When modeling data, always consider both perspectives: a booking is revenue for the host and an experience for the guest. Interviewers notice candidates who naturally think about both sides.

Experimentation is central to product development

Airbnb runs thousands of experiments concurrently. DE pipelines compute metrics for these experiments. Understand how to design pipelines that support A/B testing: logging experiment assignment, computing treatment vs control metrics, and handling novelty effects.

Airbnb DE Interview FAQ

How many rounds are in an Airbnb DE interview?+
Typically 7: recruiter screen, technical phone screen, and 5 onsite rounds covering SQL deep dive, system design, coding (Python), data modeling, and core values. The process takes 3 to 5 weeks from first contact to offer.
Does Airbnb test Airflow knowledge specifically?+
Not in a hands-on coding way, but Airflow concepts (DAGs, operators, sensors, backfills, idempotency) come up naturally in system design and pipeline discussions. Since Airbnb created Airflow, understanding orchestration principles at depth is expected.
What makes Airbnb's data modeling round unique?+
Airbnb is a two-sided marketplace. The data modeling round tests whether you think about both host and guest perspectives simultaneously. You will likely model marketplace data that serves analytics for both sides plus trust and safety.
What is the Airbnb core values interview?+
A dedicated round evaluating alignment with Airbnb's mission ('belong anywhere') and values: champion the mission, be a host, embrace the adventure, be a cereal entrepreneur. Prepare stories that demonstrate these values authentically. This round has veto power.
What is Minerva and why does it matter for DE interviews?+
Minerva is Airbnb's centralized metrics layer. It defines how every metric is calculated, ensuring consistency across the company. In system design rounds, referencing metric standardization and a single source of truth shows you understand Airbnb's data philosophy.
What level does Airbnb typically hire data engineers at?+
Most external DE hires come in at L5 (Senior) or L6 (Staff). L4 is common for new grads or candidates with fewer than 3 years of experience. L7 (Principal) external hires are rare and typically require demonstrated org-wide impact at a previous company.
How does Airbnb's DE interview compare to other FAANG companies?+
Airbnb places more emphasis on data culture and experimentation infrastructure than most peers. The core values round is unique and carries real weight. Technical questions lean toward marketplace analytics rather than generic data engineering. The overall process is rigorous but well-structured.
Can I use Python instead of SQL in the technical rounds?+
SQL is expected for the SQL deep dive and phone screen. Python is used in the coding round. For system design and data modeling, you discuss architecture rather than write code. Airbnb interviewers value readable, well-structured SQL and idiomatic Python.

Prepare at Airbnb Interview Difficulty

Airbnb DE questions test marketplace thinking, Minerva-style metric standardization, and data culture. Practice with problems that mirror two-sided platform analytics.

Practice Airbnb-Level SQL

Continue your prep

Data Engineer Interview Prep, explore the full guide

50+ guides covering every round, company, role, and technology in the data engineer interview loop. Grounded in 2,817 verified interview reports across 921 companies, collected from real candidates.

Interview Rounds

By Company

By Role

By Technology

Decisions

Question Formats