Abhishek Kumar Maurya abhishek2f24

Abhishek Kumar Maurya

Senior Data Engineer · Azure · Databricks · Kafka · Spark · Snowflake

About Me

I architect and deliver cloud-native data platforms that move real data at real scale — 5TB/day ETL, 1M IoT events/hour, 5M+ GPS records in production. Currently at GHD (UK), building geospatial and streaming pipelines for global infrastructure clients.

5 years designing end-to-end data platforms (ingestion → transformation → orchestration → serving)
Triple cloud certified — Microsoft Azure · AWS · Google Cloud
Expertise in real-time streaming (Kafka, Spark Streaming, Delta Lake) and modern data stack (dbt, Snowflake, Airflow, Databricks)
Open to fully remote roles across US, UK, Australia, and Europe

Tech Stack

Cloud

Processing & Streaming

Warehousing & Transformation

Orchestration

Languages

DevOps & IaC

BI & Visualization

Production Impact

Metric	Value
ETL throughput	5 TB/day
Streaming throughput	~1M events/hour
GPS records processed	5M+
Pipeline latency reduced	40%
Storage costs reduced	30%
Manual audit effort eliminated	65%
Securable objects validated	500+
Cloud certifications	3 (Azure + AWS + GCP)

Featured Projects

Real-Time Streaming Pipeline — Kafka to Delta Lake

End-to-end streaming data platform: Kafka ingestion → PySpark Structured Streaming → Delta Lake → Snowflake → dbt models → Airflow orchestration. Production-grade with CI/CD, data quality checks, and monitoring.
Kafka PySpark Delta Lake Snowflake dbt Airflow Docker GitHub Actions

Cloud Lakehouse on Azure — Medallion Architecture

Medallion architecture (Bronze/Silver/Gold) on Azure Data Lake Storage + Databricks + Delta Lake + ADF. Includes Unity Catalog governance, Great Expectations data quality, and Power BI serving layer.
Azure Databricks Delta Lake ADF dbt Unity Catalog Power BI

Geospatial Fleet Analytics Platform

Processes 5M+ GPS records for mining fleet operations: haversine distance, raster elevation, DBSCAN clustering for EV station placement, and interactive Leaflet.js stakeholder dashboard.
PySpark Databricks GeoPandas DBSCAN Leaflet.js Delta Lake

Certifications

Microsoft Certified: Azure Data Engineer Associate
AWS Certified Data Engineer
Google Cloud Professional Data Engineer

Currently at

GHD — Data Engineer (Remote, UK-based client)
Building geospatial data platforms for global infrastructure and mining clients.

Open to Senior Data Engineer / Data Platform Engineer roles — Remote · US · UK · Australia · Europe
abhishek2f24@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly