Airbnb Data Engineer Interview (2026)
Airbnb pioneered the modern data culture and created Apache Airflow. Their DE interviews test marketplace data modeling, experimentation infrastructure, and the ability to make data accessible across an entire organization. The process spans 3 to 5 weeks with leveling from L4 through L7.
Airbnb DE Compensation (2026)
Total compensation includes base salary, RSUs on a 4-year vest schedule, and annual refresh grants. Airbnb RSUs vest quarterly after the first year.
IC1 (L4) · Data Engineer
$160K to $240K total comp. Entry-level for new grads or candidates with 1 to 3 years of experience.
IC2 (L5) · Senior Data Engineer
$260K to $400K total comp. Most common hiring level for experienced candidates.
IC3 (L6) · Staff Data Engineer
$380K to $520K total comp. Requires demonstrated cross-team technical leadership.
IC4 (L7) · Principal Data Engineer
$500K to $700K+ total comp. Rare external hires; typically internal promotions with org-wide impact.
Airbnb DE Interview Process
Three stages from recruiter call to offer decision. Timeline: 3 to 5 weeks.
- 01
Recruiter Screen
Initial conversation about your experience and interest in Airbnb. The recruiter evaluates your background with data platforms, pipeline orchestration, and experimentation infrastructure. Airbnb has one of the strongest data cultures in tech, so they look for candidates who understand how data drives product decisions and can articulate the role of data engineering in enabling that culture.
- ▸Airbnb created Apache Airflow; mention your experience with it or similar orchestration tools
- ▸Show awareness of Airbnb's data culture: they train all employees on data literacy
- ▸Ask about the team: Search, Payments, Trust and Safety, or Data Platform have different focuses
- 02
Technical Phone Screen
SQL problems set in a marketplace context: bookings, listings, reviews, search interactions. Airbnb phone screens test aggregation, window functions, and multi-step analytical queries. The interviewer evaluates clarity of thought and how you handle ambiguity in problem definitions. You may need to ask clarifying questions about business logic before writing queries.
- ▸Practice marketplace SQL: two-sided metrics (host vs guest), conversion funnels, and seasonality
- ▸Ask clarifying questions; Airbnb interviews reward candidates who define the problem before solving it
- ▸Write readable SQL with meaningful aliases; Airbnb values code that others can maintain
- 03
Onsite Loop
Five rounds covering SQL deep dive, system design, coding (Python), data modeling, and a core values interview. System design at Airbnb emphasizes experimentation platforms, search ranking data pipelines, and pricing analytics. The core values interview evaluates alignment with Airbnb's mission and how you champion the data culture. Data modeling often involves marketplace schemas that support both operational and analytical workloads.
- ▸Know Airbnb's Minerva metrics layer; it standardizes metric definitions across the company
- ▸System design should address experimentation: how the pipeline supports A/B tests and causal inference
- ▸The core values round is serious; prepare stories about belonging, championing the mission, and hosting
Airbnb Data Tech Stack
The tools and systems Airbnb's data engineering org builds on.
| Category | Technologies |
|---|---|
| Languages | Python, Java, Scala |
| Core Frameworks | Apache Spark, Apache Airflow (created at Airbnb), Apache Hive |
| Storage | S3, Parquet, custom data lake architecture |
| Metrics Layer | Minerva (Airbnb's centralized metrics platform, single source of truth for all metric definitions) |
| Query Engines | Presto/Trino, Spark SQL |
| Orchestration | Apache Airflow (internal fork with Airbnb-specific operators) |
| ML Platform | Bighead (Airbnb's end-to-end ML infrastructure) |
Problems sourced from real Airbnb interview reports. Run your code in the browser.
Airbnb DE Team Structure
Data engineers at Airbnb are embedded across these major teams. Each has distinct technical focus areas and interview emphasis.
Search and Discovery
Ranking pipelines, search relevance features, listing quality signals, personalization data
Trust and Safety
Fraud detection, identity verification, risk scoring, trust signals across hosts and guests
Payments and Financial
Transaction pipelines, pricing models, payout reconciliation, tax and compliance data
Data Platform
Airflow infrastructure, Minerva development, data quality tooling, schema management
Host and Guest Analytics
Two-sided marketplace metrics, retention funnels, lifetime value models, engagement tracking
Pricing and Revenue Management
Dynamic pricing models, demand forecasting, occupancy optimization, smart pricing features
12 Example Questions with Guidance
Real question types from each round. The guidance shows what the interviewer evaluates and how to structure your answer.
Calculate the booking conversion rate by city and month: searches that resulted in a booking within 24 hours.
Join searches to bookings on user_id where booking_ts BETWEEN search_ts AND search_ts + INTERVAL 24 HOUR. Group by city and month. Discuss attribution: what if a user searched multiple cities before booking?
Find hosts whose average response time to booking requests increased by more than 50% compared to their previous quarter.
Calculate avg response time per host per quarter. Use LAG to get prior quarter. Filter where current > 1.5 * prior. Discuss how to handle hosts with very few requests (small sample sizes).
Identify listings that have never been booked but received more than 100 search impressions in the last 90 days.
LEFT JOIN listings to bookings, filter WHERE booking_id IS NULL. Join to search_impressions, group by listing, HAVING COUNT >= 100. Discuss what this indicates about listing quality or pricing.
Compute a rolling 7-day trust score for guests based on verified identity, reviews from hosts, response rate, and cancellation history.
Weight each signal (e.g., identity = 30%, host reviews = 30%, response rate = 20%, cancellations = 20%). Use window functions with ROWS BETWEEN 6 PRECEDING AND CURRENT ROW. Discuss how to handle new guests with no history and the cold-start problem.
Write a data quality check that validates daily booking data: row count within expected range, no future dates, all required fields non-null, and referential integrity with listings.
Define check functions for each validation. Return a structured report with pass/fail per check. Discuss how to set dynamic thresholds (e.g., expected row count based on day of week and season) and how to handle check failures in a pipeline.
Build a pipeline that detects anomalies in host pricing changes: sudden spikes during events, coordinated pricing across listings, or prices that deviate from neighborhood averages.
Compare each price change against historical distribution for that listing and its neighborhood. Use z-scores or IQR for outlier detection. Discuss how to distinguish legitimate event pricing from manipulation, and how to handle listings with limited price history.
Design Airbnb's experimentation data platform that computes metrics for thousands of concurrent A/B tests.
Event collection, experiment assignment logging, metric computation (pre-experiment and during), statistical analysis pipeline. Discuss metric standardization (Minerva), avoiding p-hacking with pre-registered metrics, and how to handle interaction effects between overlapping experiments.
Design a search ranking feature pipeline that provides real-time listing quality signals.
Batch features (historical booking rate, review scores) from Spark, real-time features (current availability, recent views) from a feature store. Discuss feature freshness requirements, how to backfill features for new listings, and serving latency constraints for search.
Design a dynamic pricing recommendation engine that suggests nightly rates to hosts based on demand, seasonality, local events, and comparable listings.
Ingest demand signals (search volume, booking pace), event calendars, and neighborhood comps. Batch compute base recommendations daily with Spark; update intra-day for demand spikes. Discuss how to handle new listings (cold start), host override behavior, and feedback loops where pricing affects demand.
Model Airbnb's marketplace data to support both host analytics and guest conversion funnels.
Fact: bookings, searches, reviews, messages. Dimensions: listings, hosts, guests, locations. Discuss the two-sided marketplace: host metrics (occupancy rate, response time, revenue) vs guest metrics (search-to-book conversion, repeat booking rate). Define grain carefully for each fact table.
Design a data model for Airbnb's host and guest funnel that tracks every step from initial search through post-stay review, supporting both real-time dashboards and weekly cohort analysis.
Define funnel stages: search impression, listing view, booking request, host response, booking confirmed, check-in, review submitted. Use a session-based fact table keyed on search_session_id. Discuss how to handle funnel re-entry (guest who views the same listing twice), attribution across devices, and how Minerva would define standard funnel metrics.
Tell me about a time you made data more accessible to non-technical stakeholders.
Airbnb's data culture means DEs serve analysts, PMs, and executives. Describe building self-serve tools, documentation, or training. Quantify impact: 'Reduced analyst time-to-answer from 2 days to 30 minutes' or 'Enabled 50 PMs to query experiment results independently.'
Common Mistakes in Airbnb DE Interviews
Patterns that consistently lead to rejections, and how to avoid them.
Treating Airbnb like a standard e-commerce company
Airbnb is a two-sided marketplace. Every metric has a host perspective and a guest perspective. Candidates who only think about the buyer side miss half the picture. When modeling bookings, consider occupancy rate (host metric) alongside conversion rate (guest metric).
Not understanding Minerva's role
Minerva is Airbnb's centralized metrics layer. It defines how every metric is calculated so that teams cannot produce conflicting numbers. If you propose a system design without mentioning metric standardization or a single source of truth, you are ignoring a core part of Airbnb's data philosophy.
Generic Airflow knowledge without depth
Airbnb created Airflow. Surface-level knowledge (DAGs, operators, scheduling) is the minimum. Interviewers expect you to discuss backfill strategies, idempotent task design, dynamic DAG generation, SLA monitoring, and how to handle task failures gracefully in production.
Ignoring seasonality and geographic variation
Airbnb's business is highly seasonal and location-dependent. A pipeline that works for New York in July will produce very different volumes for rural Japan in February. Always mention how your design handles variable data volumes, seasonal spikes, and regional differences.
Skipping the core values round preparation
The core values interview is a real evaluation round with veto power. Candidates who prepare only for technical rounds and wing the values discussion get rejected. Prepare specific stories that demonstrate belonging, hosting, and championing the mission.
What Makes Airbnb Different
Why preparing for Airbnb requires a different approach than other top-tier companies.
Airbnb created Apache Airflow
Airflow was born as an internal Airbnb project in 2014 before becoming an Apache top-level project. This means Airbnb's data infrastructure runs on a deeply customized fork with proprietary operators, monitoring, and SLA tooling. The internal engineering culture treats orchestration as a first-class discipline, not just a scheduler.
Minerva is the single source of truth for all metrics
Unlike most companies where metric definitions live in scattered dashboards and notebooks, Airbnb built Minerva to centralize every metric definition. DEs are responsible for registering metrics in Minerva and ensuring pipelines produce data that conforms to these definitions. This changes how you think about pipeline design: output is not just a table, it is a certified metric.
Two-sided marketplace complexity compounds everything
Most data engineering roles deal with one type of customer. At Airbnb, every feature, every metric, and every pipeline must account for both hosts and guests. A cancellation pipeline must update host availability, guest booking history, trust scores for both parties, and financial records. This multiplier effect makes even simple-sounding problems significantly harder.
Data culture is company-wide, not just an engineering initiative
Airbnb invests in data literacy training for all employees. Product managers, designers, and operations staff are expected to query data and interpret experiment results. DEs build infrastructure that serves the entire company, not just analysts. This means you are evaluated on how well you make data accessible, not just how well you build pipelines.
Airbnb-Specific Preparation Tips
Tactical advice for each aspect of the interview loop.
Airbnb created Airflow and has deep orchestration expertise
Apache Airflow was born at Airbnb. Interviewers expect you to understand DAG-based orchestration, dependency management, idempotent tasks, and retry strategies. If you use Airflow, know it well. If you use a competitor (Dagster, Prefect), be ready to compare.
Data culture is not a slogan, it is the operating model
Airbnb trains all employees on data literacy and has a centralized metrics layer (Minerva). DEs are expected to help the entire organization, not just serve other engineers. Prepare examples of making data accessible to non-technical users.
Marketplace metrics require two-sided thinking
Every Airbnb metric has a host side and a guest side. When modeling data, always consider both perspectives: a booking is revenue for the host and an experience for the guest. Interviewers notice candidates who naturally think about both sides.
Experimentation is central to product development
Airbnb runs thousands of experiments concurrently. DE pipelines compute metrics for these experiments. Understand how to design pipelines that support A/B testing: logging experiment assignment, computing treatment vs control metrics, and handling novelty effects.
Airbnb DE Interview FAQ
How many rounds are in an Airbnb DE interview?+
Does Airbnb test Airflow knowledge specifically?+
What makes Airbnb's data modeling round unique?+
What is the Airbnb core values interview?+
What is Minerva and why does it matter for DE interviews?+
What level does Airbnb typically hire data engineers at?+
How does Airbnb's DE interview compare to other FAANG companies?+
Can I use Python instead of SQL in the technical rounds?+
Prepare at Airbnb Interview Difficulty
Airbnb DE questions test marketplace thinking, Minerva-style metric standardization, and data culture. Practice with problems that mirror two-sided platform analytics.