I'm a Senior Data Engineer with 8+ years of experience in designing scalable data systems, real-time pipelines, and analytics platforms that power decision-making at scale.
- 🚀 Build real-time data pipelines with Kafka, Flink, and Beam
- 🧱 Design robust data models and analytics layers using dbt, BigQuery, and Snowflake
- ⚙️ Automate workflows and orchestration with Airflow & Dataform
- 📊 Enable high-performance reporting with tools like Tableau, Metabase, and Redash
- 🧠 Focus on observability, metadata, and platform reliability at scale
Data Infra: Kafka · Apache Flink · Apache Beam · Dataflow
Transformation: dbt · Dataform · SQL · Python
Warehouses: BigQuery · Snowflake · Redshift
Orchestration: Airflow · Cloud Composer · Prefect
Dashboards: Metabase · Tableau · Redash
Storage: MongoDB · MySQL · PostgreSQL · GCS · S3
Monitoring: Prometheus · Grafana · OpenMetadata
- Stateful stream processing (e.g., Flink CEP, timers)
- Data contract-driven pipelines
- Metadata and lineage tools (OpenMetadata, DataHub)
Thanks for stopping by! Feel free to explore some of my open-source experiments, workflow prototypes, and real-time data tools 🚴♂️