About DataDriven Mock Interviews

DataDriven is a free web application that simulates all four rounds of a data engineering interview: SQL, Python, Data Modeling, and Pipeline Architecture. Each domain can be practiced in two modes: Problem mode (self-paced with instant grading) and Interview mode (timed AI mock interview simulation).

Interview mode has four phases. Phase 1 (Think): you receive a deliberately vague prompt and ask clarifying questions to an AI interviewer, who responds like a real hiring manager. Phase 2 (Code/Design): you write SQL against a real PostgreSQL database, Python in a Docker sandbox, or build schemas/pipelines on an interactive canvas. Phase 3 (Discuss): the AI interviewer asks follow-up questions about your solution, one question at a time. You respond, and it asks another. This continues for up to 8 exchanges. The interviewer probes edge cases, optimization, alternative approaches, and may introduce curveball requirements that change the problem mid-interview. Phase 4 (Verdict): you receive a hire/no-hire decision with specific feedback on what you did well, where your reasoning had gaps, and what to study next.

Features: adaptive difficulty (problems scale to your performance), spaced repetition (weak concepts resurface at optimal intervals), readiness score (per-topic gap tracker), company-specific filtering (Google, Amazon, Meta, Stripe, Databricks, weighted by real interview data), and seniority calibration (Junior through Staff). 100% free, no trial, no credit card, no paywall.

Data Engineering Mock Interview Questions

1468+ data engineering mock interview questions with AI-powered feedback. Pick your domain, target company tier, and seniority level to start a timed interview simulation. Write real code, ask clarifying questions, and get graded instantly.

Available domains: Python (387 questions), SQL (903 questions), Data Modeling (56 questions), Architecture (121 questions), Spark (1 questions). Difficulty levels: easy (534), medium (677), hard (257). Seniority levels: Junior, Mid, Senior, Staff, Sr. Staff.

Python Interview Questions (387)

SQL Interview Questions (903)

Data Modeling Interview Questions (56)

Architecture Interview Questions (121)

Spark Interview Questions (1)

How Interview Mode Works (Four Phases)

  1. Phase 1 (Think): Choose a domain (SQL, Python, Data Modeling, or Pipeline Architecture), select your seniority level (Junior through Staff) and target company tier. You receive a deliberately vague prompt. Ask clarifying questions to the AI interviewer, who responds like a real hiring manager.
  2. Phase 2 (Code/Design): Write and execute your solution. SQL runs against a real PostgreSQL database. Python runs in a Docker sandbox. Data Modeling uses an interactive schema canvas. Pipeline Architecture uses an interactive design canvas.
  3. Phase 3 (Discuss): The AI interviewer asks follow-up questions about your solution, one question at a time. You respond, and it asks another. This continues for up to 8 exchanges. The interviewer probes edge cases, tests optimization awareness, challenges alternative approaches, and may introduce curveball requirements that change the problem mid-interview.
  4. Phase 4 (Verdict): Receive a hire/no-hire decision with specific feedback on what you did well, where your reasoning had gaps, what the interviewer was testing, and what to study next.

How Problem Mode Works

Problem mode is self-paced practice with clear problem statements and instant grading. No AI interviewer, no timer, no discussion phase. Focus on building skill before testing it under interview pressure.

Practice by Domain and Mode

Interview Guides

Other Resources