A
Available now

I'm Aiman Sarosh,

Senior Data Engineer specializing in AI-powered Data Platforms,
Streaming Systems, and Cloud-native Architectures.

Senior Data Engineer with 15+ years of experience in designing and building scalable data platforms across Azure and AWS ecosystems. Expertise in PySpark, Databricks, Kafka, and modern data architectures including streaming and API-driven systems.
Recently focused on building data APIs, automation systems, and AI-ready pipelines, with hands-on experience in FastAPI, serverless architectures, and real-time data processing. Strong background in transforming legacy data systems into scalable, cloud-native solutions.

What I Build

My Work

AI-powered automation systems • Real-time streaming pipelines • Cloud-native data platforms • API-driven enterprise solutions • Intelligent workflow orchestration • Scalable ETL and analytics systems

Current Focus

Focussing On

• Generative AI & Agentic AI Systems
• LLM-powered enterprise workflows
• FastAPI & AI backend systems
• AI observability & automation
• Retrieval-Augmented Generation (RAG)

Curriculum
Vitae 2024

Download CV

Main Stack

Python, FastAPI, Core Java, Shell Scripting

PySpark, Spark Streaming, Kafka, Hive, SQL

Azure (ADF, Databricks, ADLS Gen2)

Data Lakes, ETL/ELT Pipelines, Streaming Pipelines, Data Modelling

Metrics

Featured Expertise

• 15+ Years Experience
• 5+ Years Azure & AWS
• 10+ Enterprise Projects
• Real-time Streaming Systems
• Production-grade APIs
• PySpark & Kafka Streaming
• Azure Databricks
• FastAPI Microservices
• AI-ready Data Architectures
• Enterprise Automation

Latest Experience

Senior Data Engineer

IBM India • 2021 - Present

Projects

Gen-AI Chatbot Platform

A scalable chatbot platform leveraging LLMs for enterprise support.