Netflix / Spotify viewing-history dashboard
Beginner
Analyze your own exported streaming history to find your top genres, binge patterns, and time-of-day habits.
- Dataset: Your own Netflix/Spotify data export (request from account settings).
- Stack: Pandas, Plotly, Streamlit
- Stands out: Use YOUR data — recruiters remember personal, story-driven projects over Titanic for the 1000th time.
COVID / public-health trend explorer
Beginner
Build an interactive dashboard of case/vaccination trends with rolling averages and per-capita normalization.
- Dataset: Our World in Data, WHO open datasets.
- Stack: Pandas, Plotly Dash
- Stands out: Add correct per-capita and 7-day smoothing — most beginner dashboards show misleading raw counts.
E-commerce sales & cohort analysis
Intermediate
Compute RFM segments, monthly cohort retention, and revenue drivers from transaction data.
- Dataset: UCI Online Retail dataset, Kaggle e-commerce sets.
- Stack: Pandas, SQL, Seaborn
- Stands out: Show a cohort-retention heatmap — it signals real product-analytics thinking.
City Airbnb pricing analysis
Intermediate
Explore what drives listing price (location, reviews, room type) and map it geospatially.
- Dataset: Inside Airbnb (free city datasets).
- Stack: Pandas, GeoPandas, Folium
- Stands out: Geospatial maps + a clear written 'so what' section beat raw charts.
Personal finance / expense analyzer
Beginner
Parse bank/credit-card CSV exports, auto-categorize spend, and forecast monthly burn.
- Dataset: Your own anonymized bank statements.
- Stack: Pandas, Plotly
- Stands out: A rules+keyword categorizer plus a simple forecast shows end-to-end product thinking.
Sports analytics (cricket/football) dashboard
Intermediate
Analyze player/team performance, win probability, and momentum shifts.
- Dataset: Cricsheet, StatsBomb open data, football-data.co.uk.
- Stack: Pandas, Plotly, Streamlit
- Stands out: A win-probability model on top of raw stats turns a dashboard into a story.
World happiness / development indicators
Beginner
Correlate GDP, health, and freedom with happiness scores across countries and years.
- Dataset: World Happiness Report, World Bank Open Data.
- Stack: Pandas, Seaborn
- Stands out: Distinguish correlation from causation explicitly — interviewers probe this.
Stock / crypto market exploratory analysis
Intermediate
Analyze volatility, correlations, and drawdowns across assets with rolling metrics.
- Dataset: yfinance API, CoinGecko API.
- Stack: Pandas, yfinance, Plotly
- Stands out: Avoid the 'I predicted the price' trap — focus on honest risk/volatility analysis.
A/B test analysis toolkit
Intermediate
Build a reusable notebook/app that runs significance tests, power analysis, and CUPED variance reduction.
- Dataset: Simulated or Kaggle A/B test datasets.
- Stack: SciPy, statsmodels, Streamlit
- Stands out: Demonstrating statistical rigor (power, p-hacking guards) is rare and impressive.
Survey / NPS text + score analysis
Beginner
Combine quantitative NPS with theme extraction from open-ended responses.
- Dataset: Public survey datasets, Kaggle.
- Stack: Pandas, simple NLP
- Stands out: Mixing numeric and text analysis shows breadth.
Energy consumption & forecasting EDA
Intermediate
Explore seasonality and build a baseline forecast of household/grid energy use.
- Dataset: UCI Individual Household Electric Power Consumption.
- Stack: Pandas, statsmodels, Prophet
- Stands out: Proper time-series decomposition (trend/seasonality) before modeling.