How to Prep When Every Company Runs a Different Loop
Unpredictability demands portfolio depth, not specialization. Betting on format is a losing strategy. Instead, you need credible competence across five domains, then selectively deepen based on signals during recruiter calls.
SQL is non-negotiable. Window functions (RANK, DENSE_RANK, ROW_NUMBER with frame specifications) appear in roughly 80% of technical screens. They're the entry-level filter. If you can't write this cold, you won't advance to system design:
SELECT
user_id,
event_date,
revenue,
SUM(revenue) OVER (
PARTITION BY user_id
ORDER BY event_date
ROWS BETWEEN 6 PRECEDING AND CURRENT ROW
) AS rolling_7d_revenue,
LAG(revenue, 1) OVER (
PARTITION BY user_id
ORDER BY event_date
) AS prev_day_revenue
FROM daily_user_metrics
WHERE event_date >= CURRENT_DATE - INTERVAL '90 days';
Get reps on real practice problems that force you to think about edge cases, not just syntax.
Python rounds went pragmatic. Nobody is asking you to implement a trie. They want to see you read JSON, write to S3, handle errors, and reason about failure modes. Dataclass fluency, not algorithm fluency.
Data modeling is the leveling mechanism. With DSA gone, senior vs. junior distinction now lives in whether you can explain trade-offs: dimensional modeling for siloed sources, grain decisions, why you'd pick a wide denormalized table over a star schema given current storage economics. Modern interviews reward the candidate who slows down, clarifies scope, defines grain, and turns an ambiguous prompt into a decision-ready model.
Business context matters more than execution perfection. Candidates who ask "What volume? Latency? Cost constraints?" before architecting a pipeline outrank those who jump straight to Kafka + Spark. Default to batch unless latency requirement is under 5 minutes (fraud detection, real-time bidding, CDC). Know why Kappa architecture (single streaming pipeline with batch replays) is preferred over Lambda in most modern stacks. This is batch vs. streaming reasoning, not framework trivia.
Ask the recruiter what the loop looks like. Seriously. "Can you walk me through the interview stages and what each round focuses on?" is a question every candidate should ask on the first call. Airbnb runs 5 to 7 total rounds with system design as the leveling determinant. Uber runs the heaviest data modeling round in the industry. Stripe splits Python for a second system design. Knowing this before you prep is worth more than 50 hours of unfocused study.