AI / Machine Learning Engineer · Mid-level · ~5 yrs

Meghana Reddy

Senior Machine Learning Engineer

Professional summary

Machine learning engineer with 5 years taking models from notebook to production in e-commerce and fintech. Owns the full lifecycle — data pipelines, PyTorch/TensorFlow training, model deployment and MLOps — and tracks success in the numbers that matter: accuracy, latency, cost and revenue. Shipped LLM and NLP systems serving millions of requests, built the MLOps backbone teams rely on, and mentors junior engineers on experiment discipline.

9M

Users served by ML in prod

+22%

Click-through on recommendations

80ms

p99 inference latency

Skills

ML & deep learning

PyTorchTensorFlowscikit-learnHugging FaceXGBoostLightGBM

LLMs & NLP

LLM fine-tuningRAGEmbeddings & vector searchLangChainNamed-entity recognitionEvaluation harnesses

Data & MLOps

AirflowSparkFeature stores (Feast)MLflowKubeflowDockerKubernetes

Cloud & serving

AWS (SageMaker, S3, Lambda)GCP (Vertex AI, BigQuery)TritonFastAPITerraformDatadog

Work experience

Machine Learning Engineer · Kartmint (Series C e-commerce)

Aug 2022Present

Hyderabad

  • Own the product-recommendation system (PyTorch two-tower model) serving 9M users; lifted click-through 22% and add-to-cart 11% over the prior heuristics.
  • Built a real-time inference service on AWS SageMaker + Triton holding p99 latency under 80ms at 4,000 requests/second during peak sale traffic.
  • Designed the Airflow + Spark feature pipeline feeding a Feast feature store, cutting feature-freshness lag from 24 hours to 15 minutes.
  • Shipped an LLM-powered support-triage classifier (fine-tuned transformer) that auto-routes 68% of tickets, saving the CX team ~₹40L/year in handling cost.
  • Standardized MLOps with MLflow and Kubeflow pipelines, cutting model-to-production time from 3 weeks to 4 days and mentoring 2 junior engineers on it.

Data Scientist · PaySprout

Jul 2020Jul 2022

Bengaluru

  • Built a fraud-detection model (XGBoost + scikit-learn) on 12M transactions that cut fraud losses 31% while holding false positives under 0.4%.
  • Deployed batch and streaming scoring on GCP Vertex AI with automated drift monitoring, catching 3 silent data-quality regressions before they hit revenue.
  • Reduced training cost ~₹5.5L/year by moving feature computation to BigQuery and right-sizing GPU jobs.

Featured projects

5 features gated
EvalForge — open-source LLM eval harness

A pytest-style evaluation framework for LLM and RAG pipelines.

  • Authored a configurable harness scoring relevance, faithfulness and latency; adopted internally to gate 5 LLM features before release.
LLMsPythonEvaluation

Education

M.Tech Computer Science (Machine Learning)

2020

IIT Hyderabad, Hyderabad

Thesis: low-latency neural ranking for recommender systems

Certifications

AWS Certified Machine Learning – Specialty

Amazon Web Services · 2023

Google Cloud Professional ML Engineer

Google Cloud · 2022

See the work in 3D

Explore Meghana's interactive WebGL portfolio — projects, skills and a way to get in touch.