Senior Data Engineer specializing in AI-powered Data Platforms,
Streaming Systems, and Cloud-native Architectures.
Senior Data Engineer with 15+ years of experience in designing and building scalable data platforms across Azure and AWS ecosystems. Expertise in PySpark, Databricks, Kafka, and modern data architectures including streaming and API-driven systems.
Recently focused on building data APIs, automation systems, and AI-ready pipelines, with hands-on experience in FastAPI, serverless architectures, and real-time data processing. Strong background in transforming legacy data systems into scalable, cloud-native solutions.
AI-powered automation systems • Real-time streaming pipelines • Cloud-native data platforms • API-driven enterprise solutions • Intelligent workflow orchestration • Scalable ETL and analytics systems
• Generative AI & Agentic AI Systems
• LLM-powered enterprise workflows
• FastAPI & AI backend systems
• AI observability & automation
• Retrieval-Augmented Generation (RAG)
Python, FastAPI, Core Java, Shell Scripting
PySpark, Spark Streaming, Kafka, Hive, SQL
Azure (ADF, Databricks, ADLS Gen2)
Data Lakes, ETL/ELT Pipelines, Streaming Pipelines, Data Modelling
• 15+ Years Experience
• 5+ Years Azure & AWS
• 10+ Enterprise Projects
• Real-time Streaming Systems
• Production-grade APIs
• PySpark & Kafka Streaming
• Azure Databricks
• FastAPI Microservices
• AI-ready Data Architectures
• Enterprise Automation
IBM India • 2021 - Present
A scalable chatbot platform leveraging LLMs for enterprise support.