Meghana Reddy
Senior Machine Learning Engineer
Hyderabad, India • +91 98495 60277 • meghana.reddy@hey.com • linkedin.com/in/meghana-reddy-ml • github.com/meghanareddy
Portfolio: www.eliorexa.com/portfolio/ai-ml-engineer-mid
Professional Summary
Machine learning engineer with 5 years taking models from notebook to production in e-commerce and fintech. Owns the full lifecycle — data pipelines, PyTorch/TensorFlow training, model deployment and MLOps — and tracks success in the numbers that matter: accuracy, latency, cost and revenue. Shipped LLM and NLP systems serving millions of requests, built the MLOps backbone teams rely on, and mentors junior engineers on experiment discipline.
Skills
ML & deep learning: PyTorch, TensorFlow, scikit-learn, Hugging Face, XGBoost, LightGBM
LLMs & NLP: LLM fine-tuning, RAG, Embeddings & vector search, LangChain, Named-entity recognition, Evaluation harnesses
Data & MLOps: Airflow, Spark, Feature stores (Feast), MLflow, Kubeflow, Docker, Kubernetes
Cloud & serving: AWS (SageMaker, S3, Lambda), GCP (Vertex AI, BigQuery), Triton, FastAPI, Terraform, Datadog
Core Competencies
Python · PyTorch · TensorFlow · scikit-learn · LLMs · NLP · MLOps · data pipelines · model deployment · AWS · GCP · recommender systems · mentoring
Work Experience
Machine Learning Engineer — Kartmint (Series C e-commerce)
Aug 2022 – Present
Hyderabad
- Own the product-recommendation system (PyTorch two-tower model) serving 9M users; lifted click-through 22% and add-to-cart 11% over the prior heuristics.
- Built a real-time inference service on AWS SageMaker + Triton holding p99 latency under 80ms at 4,000 requests/second during peak sale traffic.
- Designed the Airflow + Spark feature pipeline feeding a Feast feature store, cutting feature-freshness lag from 24 hours to 15 minutes.
- Shipped an LLM-powered support-triage classifier (fine-tuned transformer) that auto-routes 68% of tickets, saving the CX team ~₹40L/year in handling cost.
- Standardized MLOps with MLflow and Kubeflow pipelines, cutting model-to-production time from 3 weeks to 4 days and mentoring 2 junior engineers on it.
Data Scientist — PaySprout
Jul 2020 – Jul 2022
Bengaluru
- Built a fraud-detection model (XGBoost + scikit-learn) on 12M transactions that cut fraud losses 31% while holding false positives under 0.4%.
- Deployed batch and streaming scoring on GCP Vertex AI with automated drift monitoring, catching 3 silent data-quality regressions before they hit revenue.
- Reduced training cost ~₹5.5L/year by moving feature computation to BigQuery and right-sizing GPU jobs.
Projects
EvalForge — open-source LLM eval harness — 5 features gated
A pytest-style evaluation framework for LLM and RAG pipelines.
- Authored a configurable harness scoring relevance, faithfulness and latency; adopted internally to gate 5 LLM features before release.
Tech: LLMs, Python, Evaluation
Interactive 3D Portfolio
WebGL portfolio built on Eliorexa, linked from this résumé.
- Reactive three.js hero, scroll-driven case studies, lazy-loaded and reduced-motion safe — LCP 1.2s, CLS 0.02 on mobile.
Tech: three.js, React, Performance
Education
M.Tech Computer Science (Machine Learning)
2020
IIT Hyderabad, Hyderabad
Thesis: low-latency neural ranking for recommender systems
Certifications
- AWS Certified Machine Learning – Specialty — Amazon Web Services (2023)
- Google Cloud Professional ML Engineer — Google Cloud (2022)