Airbnb Data Engineer Interview

Airbnb pioneered the modern data culture and created Apache Airflow. Their DE interviews test marketplace data modeling, experimentation infrastructure, and the ability to make data accessible across an entire organization. The process spans 3 to 5 weeks with leveling from L4 through L7.

Airbnb

Travel · San Francisco, US · ABNB

live data · June 11, 2026

DE total comp

$380K–$530K

senior level · full ladder below

Hiring now

5 open DE roles

live from career pages

Team happiness

57 / 100 · Neutral

model score from employee signals

Layoff risk (30d)

Low

Employee sentiment

4.0 / 5

Mixed

Employees

5,001–50,000

Airbnb data engineer compensation

Industry ranges by level.

Level	Base	Total comp
JuniorL3	$130K–$160K	$175K–$225K
Mid-levelL4	$160K–$200K	$250K–$350K
SeniorL5	$200K–$250K	$380K–$530K
StaffL6	$240K–$300K	$500K–$700K
PrincipalL7	$285K–$360K	$700K–$1M

Airbnb DE Interview Process

Three stages from recruiter call to offer decision. Timeline: 3 to 5 weeks.

01
Recruiter Screen
Initial conversation about your experience and interest in Airbnb. The recruiter evaluates your background with data platforms, pipeline orchestration, and experimentation infrastructure. Airbnb has one of the strongest data cultures in tech, so they look for candidates who understand how data drives product decisions and can articulate the role of data engineering in enabling that culture.
- ▸Airbnb created Apache Airflow; mention your experience with it or similar orchestration tools
- ▸Show awareness of Airbnb's data culture: they train all employees on data literacy
- ▸Ask about the team: Search, Payments, Trust and Safety, or Data Platform have different focuses
02
Technical Phone Screen
SQL problems set in a marketplace context: bookings, listings, reviews, search interactions. Airbnb phone screens test aggregation, window functions, and multi-step analytical queries. The interviewer evaluates clarity of thought and how you handle ambiguity in problem definitions. You may need to ask clarifying questions about business logic before writing queries.
- ▸Practice marketplace SQL: two-sided metrics (host vs guest), conversion funnels, and seasonality
- ▸Ask clarifying questions; Airbnb interviews reward candidates who define the problem before solving it
- ▸Write readable SQL with meaningful aliases; Airbnb values code that others can maintain
03
Onsite Loop
Five rounds covering SQL deep dive, system design, coding (Python), data modeling, and a core values interview. System design at Airbnb emphasizes experimentation platforms, search ranking data pipelines, and pricing analytics. The core values interview evaluates alignment with Airbnb's mission and how you champion the data culture. Data modeling often involves marketplace schemas that support both operational and analytical workloads.
- ▸Know Airbnb's Minerva metrics layer; it standardizes metric definitions across the company
- ▸System design should address experimentation: how the pipeline supports A/B tests and causal inference
- ▸The core values round is serious; prepare stories about belonging, championing the mission, and hosting

The Airbnb data stack

What their data engineers work with day to day. Worth brushing up on the heavy hitters before the loop.

Languages

Scala5 SQL5Java4Python3

Tools and platforms

Spark5 BigQuery3 Flink3 Kafka3MySQL3PostgreSQL3Redshift3Glue1Airflow1AWS1

Airbnb DE Team Structure

Data engineers at Airbnb are embedded across these major teams. Each has distinct technical focus areas and interview emphasis.

Search and Discovery

Ranking pipelines, search relevance features, listing quality signals, personalization data

Trust and Safety

Fraud detection, identity verification, risk scoring, trust signals across hosts and guests

Payments and Financial

Transaction pipelines, pricing models, payout reconciliation, tax and compliance data

Data Platform

Airflow infrastructure, Minerva development, data quality tooling, schema management

Host and Guest Analytics

Two-sided marketplace metrics, retention funnels, lifetime value models, engagement tracking

Pricing and Revenue Management

Dynamic pricing models, demand forecasting, occupancy optimization, smart pricing features

Real Airbnb interview questions

Reported questions from this company's loops, tagged by domain, round, and level.

Pythononsite python· L52025

Given a list of side lengths, find the number of possible triangles that can be formed from those lengths.

Python coding question from Airbnb Data Engineer virtual onsite. Expected approach: for each triplet of sides (a, b, c) from the list, check the triangle inequality: a + b > c AND a + c > b AND b + c > a. Brute force: use itertools.combinations(sides, 3) to generate all triplets and count valid ones — O(n^3). Optimized: sort the array first, then for each pair (i, j), use binary search to find the count of valid third sides — O(n^2 log n). Edge cases: duplicate side lengths should still form distinct triangles if at different indices, and degenerate triangles (a + b == c) are typically…

Data Modelingonsite data modeling· L62025

Design a data model for a property booking platform including transaction, user, host, property, pricing, review, and geography tables with primary and foreign key relationships

Airbnb onsite data modeling round. Candidate presented with 5 business requirements and asked to develop a complete data model. Expected entities: transaction table (fact), users, hosts, properties, price table, reviews, geography, and time bucket tables. Interviewer probed on primary/foreign key decisions, normalization vs denormalization trade-offs, how to partition data for scale, and how to handle slowly changing dimensions for pricing. Follow-up: write SQL queries against the model. Corroborated across 3+ independent interview platforms.

SQLtechnical screen· L5

Write SQL queries on different tables

Pythononsite python· L52025

Given a list of file paths, build a nested dictionary representing the directory tree structure

Write a function build_file_tree(paths) that takes a list of file path strings and returns a nested dictionary representing the directory structure. Files (leaf nodes) should have a value of None. Paths are always relative (no leading '/'), contain no '..' or '.' segments, and have no trailing slashes. Example: paths = ['src/main.py', 'src/utils/helper.py', 'README.md', 'src/utils/config.py'] Output: { 'src': { 'main.py': None, 'utils': { 'helper.py': None, 'config.py': None } }, 'README.md': None } paths = ['a/b/c/d.txt'] Output:…

Pythononsite python· L52025

Write a function that converts a flat list of parent-child pairs into a nested dictionary tree

Write a function build_tree(pairs) where each pair is a tuple (parent, child). Return the tree as a nested dictionary. Assume a single root node (one value that appears as a parent but never as a child), no cycles, and each child has exactly one parent. Leaf nodes map to an empty dict. Example: pairs = [('A', 'B'), ('A', 'C'), ('B', 'D'), ('B', 'E'), ('C', 'F')] Output: {'A': {'B': {'D': {}, 'E': {}}, 'C': {'F': {}}}} pairs = [('root', 'a'), ('root', 'b')] Output: {'root': {'a': {}, 'b': {}}} pairs = [('x', 'y'), ('y', 'z')] Output: {'x': {'y': {'z': {}}}} Edge cases: single…

Common Mistakes in Airbnb DE Interviews

Patterns that consistently lead to rejections, and how to avoid them.

Treating Airbnb like a standard e-commerce company

Airbnb is a two-sided marketplace. Every metric has a host perspective and a guest perspective. Candidates who only think about the buyer side miss half the picture. When modeling bookings, consider occupancy rate (host metric) alongside conversion rate (guest metric).

Not understanding Minerva's role

Minerva is Airbnb's centralized metrics layer. It defines how every metric is calculated so that teams cannot produce conflicting numbers. If you propose a system design without mentioning metric standardization or a single source of truth, you are ignoring a core part of Airbnb's data philosophy.

Generic Airflow knowledge without depth

Airbnb created Airflow. Surface-level knowledge (DAGs, operators, scheduling) is the minimum. Interviewers expect you to discuss backfill strategies, idempotent task design, dynamic DAG generation, SLA monitoring, and how to handle task failures gracefully in production.

Ignoring seasonality and geographic variation

Airbnb's business is highly seasonal and location-dependent. A pipeline that works for New York in July will produce very different volumes for rural Japan in February. Always mention how your design handles variable data volumes, seasonal spikes, and regional differences.

Skipping the core values round preparation

The core values interview is a real evaluation round with veto power. Candidates who prepare only for technical rounds and wing the values discussion get rejected. Prepare specific stories that demonstrate belonging, hosting, and championing the mission.

What Makes Airbnb Different

Why preparing for Airbnb requires a different approach than other top-tier companies.

Airbnb created Apache Airflow

Airflow was born as an internal Airbnb project in 2014 before becoming an Apache top-level project. This means Airbnb's data infrastructure runs on a deeply customized fork with proprietary operators, monitoring, and SLA tooling. The internal engineering culture treats orchestration as a first-class discipline, not just a scheduler.

Minerva is the single source of truth for all metrics

Unlike most companies where metric definitions live in scattered dashboards and notebooks, Airbnb built Minerva to centralize every metric definition. DEs are responsible for registering metrics in Minerva and ensuring pipelines produce data that conforms to these definitions. This changes how you think about pipeline design: output is not just a table, it is a certified metric.

Two-sided marketplace complexity compounds everything

Most data engineering roles deal with one type of customer. At Airbnb, every feature, every metric, and every pipeline must account for both hosts and guests. A cancellation pipeline must update host availability, guest booking history, trust scores for both parties, and financial records. This multiplier effect makes even simple-sounding problems significantly harder.

Data culture is company-wide, not just an engineering initiative

Airbnb invests in data literacy training for all employees. Product managers, designers, and operations staff are expected to query data and interpret experiment results. DEs build infrastructure that serves the entire company, not just analysts. This means you are evaluated on how well you make data accessible, not just how well you build pipelines.

Airbnb-Specific Preparation Tips

Tactical advice for each aspect of the interview loop.

Airbnb created Airflow and has deep orchestration expertise

Apache Airflow was born at Airbnb. Interviewers expect you to understand DAG-based orchestration, dependency management, idempotent tasks, and retry strategies. If you use Airflow, know it well. If you use a competitor (Dagster, Prefect), be ready to compare.

Data culture is not a slogan, it is the operating model

Airbnb trains all employees on data literacy and has a centralized metrics layer (Minerva). DEs are expected to help the entire organization, not just serve other engineers. Prepare examples of making data accessible to non-technical users.

Marketplace metrics require two-sided thinking

Every Airbnb metric has a host side and a guest side. When modeling data, always consider both perspectives: a booking is revenue for the host and an experience for the guest. Interviewers notice candidates who naturally think about both sides.

Experimentation is central to product development

Airbnb runs thousands of experiments concurrently. DE pipelines compute metrics for these experiments. Understand how to design pipelines that support A/B testing: logging experiment assignment, computing treatment vs control metrics, and handling novelty effects.

Airbnb practice set

Problems on the platform tagged and predicted for Airbnb loops, from live listings and interview reports.

SQLeasy~5 min

Full Customer Order List

Return first_name, last_name, and country for every customer in customers. Sort alphabetically by first_name, then last_name.

Pythonmedium~10 min

Detect Cycle in Sequence

You are given a list of integers where each value at index i is the next index to visit (or -1 to terminate). Starting from index 0, follow the chain and return True if you revisit any index, False otherwise. Out-of-range indices (including -1) count as termination, not a cycle.

SQLeasy~5 min

High Volume Batch Jobs

Surface all batch jobs that processed more than 5000 rows, showing each job's name, priority, and rows processed, ranked from most to fewest.

Pythoneasy~10 min

The Bitwise Judge

Given an integer n (possibly negative), return True if n is even, False if odd. Solve using bitwise operations only - no %, no /, no //.

SQLmedium~5 min

Active Duo

The growth team is building a cross-engagement segment of users who both make purchases and log browsing sessions on the platform. Return a deduplicated list of usernames for users with activity in both areas.

Pythoneasy~10 min

Quantile Calculator

Given a list of numbers and percentile (0-100), return the value at that percentile using linear interpolation. The index is percentile / 100 * (n - 1); if fractional, linearly interpolate between the floor and ceiling indices of the sorted values.

Airbnb DE Interview FAQ

How many rounds are in an Airbnb DE interview?+

Typically 7: recruiter screen, technical phone screen, and 5 onsite rounds covering SQL deep dive, system design, coding (Python), data modeling, and core values. The process takes 3 to 5 weeks from first contact to offer.

Does Airbnb test Airflow knowledge specifically?+

Not in a hands-on coding way, but Airflow concepts (DAGs, operators, sensors, backfills, idempotency) come up naturally in system design and pipeline discussions. Since Airbnb created Airflow, understanding orchestration principles at depth is expected.

What makes Airbnb's data modeling round unique?+

Airbnb is a two-sided marketplace. The data modeling round tests whether you think about both host and guest perspectives simultaneously. You will likely model marketplace data that serves analytics for both sides plus trust and safety.

What is the Airbnb core values interview?+

A dedicated round evaluating alignment with Airbnb's mission ('belong anywhere') and values: champion the mission, be a host, embrace the adventure, be a cereal entrepreneur. Prepare stories that demonstrate these values authentically. This round has veto power.

What is Minerva and why does it matter for DE interviews?+

Minerva is Airbnb's centralized metrics layer. It defines how every metric is calculated, ensuring consistency across the company. In system design rounds, referencing metric standardization and a single source of truth shows you understand Airbnb's data philosophy.

What level does Airbnb typically hire data engineers at?+

Most external DE hires come in at L5 (Senior) or L6 (Staff). L4 is common for new grads or candidates with fewer than 3 years of experience. L7 (Principal) external hires are rare and typically require demonstrated org-wide impact at a previous company.

How does Airbnb's DE interview compare to other FAANG companies?+

Airbnb places more emphasis on data culture and experimentation infrastructure than most peers. The core values round is unique and carries real weight. Technical questions lean toward marketplace analytics rather than generic data engineering. The overall process is rigorous but well-structured.

Can I use Python instead of SQL in the technical rounds?+

SQL is expected for the SQL deep dive and phone screen. Python is used in the coding round. For system design and data modeling, you discuss architecture rather than write code. Airbnb interviewers value readable, well-structured SQL and idiomatic Python.

02 / Why practice

Prepare at Airbnb Interview Difficulty

01
Active recall beats re-reading by 50%
Cognitive-science meta-reviews (Dunlosky et al., 2013) rank practice testing as a top-tier study technique, while re-reading and highlighting rank near the bottom
02
76% of hiring managers reject on the coding task, not the resume
From HackerRank's 2024 Developer Skills Report. Candidates who look strong on paper still fail the live screen if they haven't done timed, executable practice
03
Five problem shapes cover 80% of data engineer loops
Dedup, sessionization, top-N-per-group, slowly-changing dimensions, partition tricks. Writing the shapes by hand turns the unfamiliar into pattern recognition

Practice Airbnb-Level SQL

Related Guides

DE Interview Prep Guide→

Complete preparation framework for data engineering interviews

Data Modeling Questions→

Star schema, SCD, grain definition, and normalization

SQL Interview Questions→

Every SQL topic tested in DE interviews with frequency data

Airbnb Data Engineer Interview

Airbnb data engineer compensation

Airbnb DE Interview Process

Recruiter Screen

Technical Phone Screen

Onsite Loop

The Airbnb data stack

Airbnb DE Team Structure

Search and Discovery

Trust and Safety

Payments and Financial

Data Platform

Host and Guest Analytics

Pricing and Revenue Management

Real Airbnb interview questions

Common Mistakes in Airbnb DE Interviews

Treating Airbnb like a standard e-commerce company

Not understanding Minerva's role

Generic Airflow knowledge without depth

Ignoring seasonality and geographic variation

Skipping the core values round preparation

What Makes Airbnb Different

Airbnb created Apache Airflow

Minerva is the single source of truth for all metrics

Two-sided marketplace complexity compounds everything

Data culture is company-wide, not just an engineering initiative

Airbnb-Specific Preparation Tips

Airbnb created Airflow and has deep orchestration expertise

Data culture is not a slogan, it is the operating model

Marketplace metrics require two-sided thinking

Experimentation is central to product development

Airbnb practice set

Airbnb DE Interview FAQ

Prepare at Airbnb Interview Difficulty

Related Guides