Hello, I'm

Kranthi
Vanukuru

Data Scientist | ML Engineer

I turn data into insights and build intelligent models that drive real-world impact using AI, cloud, and scalable data engineering.

10M+

Records Processed

25+

Dashboards Built

15+

End-to-End Projects

30%

Cost Optimization

Profile

About Me

Curious mind.
Data driven.

I’m a Data Scientist with a strong foundation in statistics, machine learning, and data engineering. I enjoy solving complex problems, uncovering patterns, and building data-driven solutions that create real business value.

📍 Columbia, SC💼 Data Scientist☁️ Cloud & ML⚙️ Data Engineering

2+

Years Experience

AWS / GCP

Cloud & ML Engineering

PySpark

Big Data Processing

AI/ML

Predictive Analytics

My Tech Stack

Tools & Technologies
I Work With

Languages

Python

SQL

R

Data Science / ML

Pandas

TensorFlow

Scikit

PyTorch

📊

XGBoost

Jupyter

Data Engineering

PySpark

Kafka

Airflow

Iceberg

Hive

ETL

AWS / Cloud

S3

SageMaker

📊

Athena

Lambda

CloudWatch

Step Functions

Visualization / BI

Tableau

Power BI

📈

Plotly

📋

Excel

📉

Seaborn

📊

Matplotlib

Featured Projects

Selected Work

Data Engineering

FEPDO Reconciliation Automation

Automated FEP data reconciliation using PySpark, Glue, and Athena. Reduced processing time and improved data accuracy.

PySparkGlueAthenaS3
Data Lake

Iceberg Optimization Pipeline

Optimized Iceberg tables with partitioning and compaction strategies for faster analytics queries.

IcebergAWS GlueAthena
Cloud / Observability

Step Function Monitoring Dashboard

Built monitoring dashboards for AWS Step Functions using CloudWatch, Athena, and reporting tables.

CloudWatchAthenaAWS
Healthcare Analytics

Lightbeam Data Pipelines

Developed scalable eligibility and patient attribute pipelines for healthcare analytics and reporting.

AWS GlueKafkaRedshift

Professional Experience

Career Journey

DRDO

Software Engineer Intern

2021

Worked on software development and automation initiatives, contributing to research-oriented engineering workflows and backend system enhancements.

JavaBackendAutomation

National Exchange Carrier Association

Data Engineer

2022 - 2023

Built scalable ETL pipelines, optimized data workflows, and supported cloud-based analytics initiatives using big data technologies and AWS services.

PySparkAWSETL

BlueCross BlueShield

Data Scientist

2023 - Present

Leading cloud-native analytics and machine learning initiatives, building scalable healthcare data pipelines, reconciliation systems, and intelligent reporting solutions.

Machine LearningAWS GlueHealthcare Analytics

Certifications & Achievements

Continuous Learning

☁️

AWS Cloud

Expertise in building scalable cloud-native analytics and ETL solutions using AWS services.

Cloud Engineering
🧠

Machine Learning

Applied ML concepts for predictive analytics, intelligent automation, and data-driven insights.

AI / ML
📊

Data Engineering

Built scalable big data pipelines using PySpark, Kafka, AWS Glue, Athena, and Iceberg.

Big Data
🎓

Master’s in AI

Executive Master’s in Artificial Intelligence focused on applied AI, ethics, and real-world systems.

University of the Cumberlands

Let's Connect

Ready to build something
data-driven?

I'm open to data science, machine learning, cloud analytics, and data engineering opportunities.