Data dashboards, code, and cloud pipeline visual

Data Science | MLOps | Backend Systems

Sagandeep Kaur

IIT Madras data science student building data products, ML systems, and production-oriented applications with clean APIs, reliable pipelines, and measurable model behavior.

Profile

Data science foundation with production engineering habits

I work across data analysis, backend systems, ML workflows, and deployment pipelines. My projects combine modeling with practical engineering: APIs, validation, automation, observability, and interfaces that make model outputs usable.

9.04

IIT Madras CGPA

9.5

Project CGPA

7.5

IELTS

Experience

Applied engineering and teaching work

Recent roles across web development, NLP, student support, and event operations.

Web Development Intern

NPTEL, IIT Madras

Sep 2025 - Jun 2026

  • Developed and optimized SQL queries for large learner datasets, improving reporting efficiency.
  • Built Flask APIs that enabled structured data access for analytics workflows.
  • Automated data pipelines using PHP and SQL, reducing manual reporting effort.
  • Supported data workflows with validation checks to improve data quality.

NLP Intern

Department of Management Studies, IIT Madras

Sep 2025 - Dec 2025

  • Analyzed system data patterns and generated insights through visualization.
  • Implemented topic modeling and interpretability techniques for unstructured data.
  • Performed preprocessing to structure raw text data for downstream analysis.
  • Improved usability of model outputs through structured interpretation.

Teaching Assistant

IIT Madras

May 2024 - Sep 2024 | May 2025 - Sep 2025 | Jan 2026 - May 2026

  • Explained Python programming and software engineering concepts to students.
  • Guided students through assignments, problem solving, and debugging workflows.
  • Simplified complex topics for better learning outcomes.

Event Deputy Head

Research Summit 24 (Paradox), IIT Madras

2024

  • Coordinated event operations and managed cross-functional collaboration.

Technical Skills

Tools used across data, ML, and deployment

Programming and CS

PythonJavaSQLData StructuresOOPProblem Solving

Backend and Systems

FlaskFastAPIREST APIsAuthenticationAsync TasksDockerKubernetes (GKE)GitHub ActionsOpenTelemetry

Data and ML

Data CleaningEDAFeature EngineeringRegressionDecision TreesRandom ForestSVMSHAPFairlearn

MLOps and Cloud

DVCMLflowMLOpsLLMOpsMLSecOpsVertex AIGCS

Tools and Visualization

MatplotlibSeabornExcelGitJiraSQLite

Projects

Selected systems and applied ML work

A mix of application engineering, model development, data analysis, MLOps, LLMOps, and computer vision projects.

QC-Leap

AI-Powered Manufacturing Quality Control System

GitHub

Next.js, TypeScript, Tailwind CSS

  • Developed application workflows for automated defect detection using computer vision.
  • Focused on integrating ML models into a usable production-style interface.

System Threat Forecaster

System vulnerability classification

GitHub

Python, Scikit-learn, Pandas, Matplotlib

  • Built a classification model to predict system vulnerabilities.
  • Implemented preprocessing, evaluation, and model performance analysis.

Household Services Platform

Role-based services marketplace

GitHub

Vue.js, JWT, Celery, Redis, Mailhog

  • Built a full-stack application with admin, customer, and professional roles.
  • Developed APIs, authentication, async task handling, and workflow automation.

Influencer Marketing App

Brand and influencer collaboration backend

GitHub

Flask, SQLite, Python

  • Designed database schema and REST APIs for campaign and user data handling.
  • Built backend flows to connect brands with influencers.

Business Data Analysis

Local dairy shop sales analysis

GitHub

Python, Pandas, Excel, Matplotlib, Seaborn, SQLite, Flask

  • Analyzed sales and customer data to identify trends and improve inventory decisions.
  • Created visualizations to communicate insights clearly.

Multilingual Speech Recognition

Speech-to-text pipeline

GitHub

Whisper, Faster-Whisper, Indic tools

  • Built multilingual audio transcription workflows with optimized inference.
  • Handled environment setup and dependency management for model execution.

Multilingual Sentiment Analysis

LLM-based sentiment classification

GitHub

Gemma, Prompt Engineering, Python

  • Leveraged Gemma for sentiment classification across multilingual datasets.
  • Designed prompts and evaluated model outputs for consistency and accuracy.

Denoising and 4x Super-Resolution

Low-light image restoration

GitHub

Python, U-Net, PixelShuffle, TTA

  • Built an end-to-end restoration pipeline for IIT Madras DLP 2026 NPPE3 on RELLISUR.
  • Reduced competition RMSE from 41.8 to 18.7 using SRUNet, multi-task loss, and calibration.
  • Used EDA to model non-Gaussian noise and improve validation across lighting conditions.

End-to-End MLOps Pipeline

Cloud-native ML deployment and observability

GitHub

DVC, MLflow, FastAPI, Docker, GKE, GitHub Actions, OpenTelemetry

  • Orchestrated data versioning, experiment tracking, model registry, and deployment on GCP.
  • Automated CI/CD for a Dockerized FastAPI service on GKE with HPA and load testing.
  • Integrated MLSecOps, drift monitoring, fairness assessment, SHAP, and LLM guardrails.

RAG Document QnA

Local retrieval-augmented document assistant

GitHub

Python, FAISS, Streamlit, Gemini

  • Engineered a local RAG system for grounded answers from proprietary documents.
  • Optimized CPU-bound semantic search with FAISS flat L2 indexing.
  • Built a Streamlit dashboard with PDF ingestion, session state, and constrained prompts.

Education

Academic background

Indian Institute of Technology, Madras

B.S. Data Science and Programming

2023 - 2027 | CGPA: 9.04 | Project CGPA: 9.5

CBSE Class 12

Senior secondary education

2022 | 80.2%

CBSE Class 10

Secondary education

2020 | 92%

Contact

Open to data science, backend, and ML engineering opportunities.

Reach out for internships, collaborations, or projects involving analytics, applied ML, backend systems, and deployment workflows.