Web version

Ananya Deshpande

Data Scientist (Entry-Level)

Pune, India • +91 98220 51764 • ananya.deshpande.ds@gmail.com • linkedin.com/in/ananya-deshpande-ds • github.com/ananyadesh

Portfolio: www.eliorexa.com/portfolio/data-scientist-fresher

Professional Summary

Final-year Statistics & Computer Science graduate who turns messy data into clear decisions using Python, R and SQL. Comfortable across the workflow — exploratory analysis in pandas, hypothesis testing, machine-learning models in scikit-learn, and crisp data visualization in Tableau. Analytically curious and rigorous about statistics, demonstrated across a forecasting capstone, a churn-prediction project and a Kaggle medal.

Skills

Languages: Python, R, SQL, Bash

ML & stats: scikit-learn, statistics, machine learning, hypothesis testing, A/B testing, regression, time-series

Data wrangling: pandas, NumPy, dplyr, Jupyter, feature engineering

Visualization & BI: Tableau, matplotlib, seaborn, ggplot2, Power BI

Core Competencies

Python · R · SQL · statistics · machine learning · pandas · scikit-learn · data visualization · A/B testing · Tableau · Data Scientist

Work Experience

Data Science InternLumio Analytics

May 2025Aug 2025

Remote

  • Built a churn-prediction model in scikit-learn (gradient boosting) reaching 0.88 ROC-AUC, flagging at-risk accounts 3 weeks earlier than the existing rules engine.
  • Automated a weekly KPI pipeline in Python + SQL that replaced a manual spreadsheet, saving the analytics team ~6 hours/week.
  • Ran an A/B test on two onboarding emails (n = 18k), establishing statistical significance (p = 0.013) and a 9% lift in activation.
  • Designed 4 Tableau dashboards for the growth team, cutting recurring ad-hoc data requests roughly 40%.

Projects

DemandCast — retail demand forecasting (capstone) — MAPE 19% → 11%

Time-series forecasting system for a mid-size grocery chain, built as the degree capstone.

  • Engineered features from 3 years of sales data in pandas and benchmarked SARIMA against gradient-boosted trees, cutting forecast MAPE from 19% to 11%.
  • Shipped an interactive Tableau dashboard so category managers could explore forecasts by store and SKU.

Tech: Python, time-series, scikit-learn, Tableau

FraudLens — transaction anomaly detection — 0.91 fraud recall

Imbalanced-classification study on a public payments dataset.

  • Handled severe class imbalance with SMOTE and cost-sensitive learning, lifting recall on fraud cases to 0.91 at fixed precision.
  • Documented the statistical evaluation (precision-recall, calibration) in a reproducible R Markdown report.

Tech: Python, R, machine learning, statistics

Education

B.Sc. (Hons.) Statistics & Computer Science

2026

Fergusson College (Savitribai Phule Pune University), Pune

CGPA 9.1/10 · Coursework: Probability, Inference, Regression, Machine Learning, Databases, Data Visualization

Certifications

  • IBM Data Science (Professional Certificate)Coursera (2025)
  • Tableau Desktop SpecialistTableau (2025)