Great score on Implement LRU Cache Algorithm, Difficulty Medium
Great score on Redesigning an ETL Pipeline for Enhanced Performance, Difficulty Medium
Great score on Optimize Data Storage Strategies for Hybrid Cloud Environments, Difficulty Hard
Great score on Reverse Digits of a 32-bit Integer, Difficulty Easy
Great score on Word Search in Grid, Difficulty Medium
Great score on Validate Parentheses in a String, Difficulty Hard
Architected and deployed a 10-node Apache Spark Standalone cluster with high availability
Provisioned and configured 10 Linux production servers for a MapR-based data lake environment
Deployed and configured Dremio on an on-premises Kubernetes cluster supporting 700+ users and 80TB of enterprise data, enabling scalable, high-availability analytics.
Converted 10+ legacy SSIS pipelines into automated Apache Spark workflows orchestrated with Apache Airflow, reducing processing time by 45% and eliminating manual intervention.
Setup and secured Apache Airflow with Azure SSO, enabling centralized identity management.
Developed highly accurate forecasting models with Azure ML
Developed and implemented Object-Oriented and functional programming concepts
Collaborated on the development of BrachOps during the Microsoft Global Hackathon
Developed a SaaS Customer Lifetime Value (CLV) metric, enabling data-driven customer retention and revenue optimization.
Gathered, cleaned, and maintained large datasets from various flight records
Conducted detailed data analysis to identify trends, patterns, and insights
Partnered with cross-functional teams to develop data-driven strategies
An end-to-end data engineering pipeline that extracts name, symbol, and stock data for multiple companies using yfinance, stores the raw data in Azure Data Lake, transforms it with PySpark, and loads it into Azure SQL Server.
Implemented an Xception model (ImageNet) for clothing classification (e.g., pants, hats, shirts) using TensorFlow Serving, gRPC, protobuf, Docker, and Kubernetes.