Frame it
- Two-stage funnel: candidate generation (retrieve ~1000 from millions) then ranking (score the 1000).
- Frame ranking as predicting P(engagement) — watch, click, purchase.
The round that decides mid & senior offers — and the one with the least free material. A repeatable framework plus 10 worked case studies: recommendations, ranking, fraud, search, RAG, churn, ETA, moderation, and forecasting.
Pin down the business goal, the user, scale (QPS, users, items), latency budget, and what 'success' means before touching models. Restate it back to the interviewer.
Map the business goal to a concrete ML formulation — classification, ranking, regression, retrieval, generation — and define the label. The hardest part is often where the label comes from.
Separate the offline metric (AUC, NDCG, RMSE, recall@k) from the online/business metric (CTR, revenue, retention) and name the gap between them. Always propose an A/B test.
Where does training data come from? Implicit (clicks) vs explicit (ratings) labels, class imbalance, label delay, and feedback loops / position bias.
User, item, context, and cross features. Discuss freshness (batch vs real-time), an embedding/feature store, and avoiding train-serve skew.
Start simple (logistic regression / GBT baseline), then justify going deeper (two-tower, DLRM, transformer). State the latency/accuracy trade-off explicitly.
Candidate generation vs ranking, ANN retrieval, caching, batching, and a heavy-then-light multi-stage funnel. Cover cold start and fallbacks.
Drift detection, guardrail metrics, online evaluation, retraining cadence, and failure modes. End by listing what you'd build next.