Skip to content
View sujeetkh's full-sized avatar

Block or report sujeetkh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sujeetkh/README.md

👋 Hi there, I'm Sujeet

I'm a Senior Data Engineer with 8+ years of experience in designing scalable data systems, real-time pipelines, and analytics platforms that power decision-making at scale.


🔧 What I Do

  • 🚀 Build real-time data pipelines with Kafka, Flink, and Beam
  • 🧱 Design robust data models and analytics layers using dbt, BigQuery, and Snowflake
  • ⚙️ Automate workflows and orchestration with Airflow & Dataform
  • 📊 Enable high-performance reporting with tools like Tableau, Metabase, and Redash
  • 🧠 Focus on observability, metadata, and platform reliability at scale

🛠️ Tech Stack

Data Infra: Kafka · Apache Flink · Apache Beam · Dataflow
Transformation: dbt · Dataform · SQL · Python
Warehouses: BigQuery · Snowflake · Redshift
Orchestration: Airflow · Cloud Composer · Prefect
Dashboards: Metabase · Tableau · Redash
Storage: MongoDB · MySQL · PostgreSQL · GCS · S3
Monitoring: Prometheus · Grafana · OpenMetadata


🌱 Currently Exploring

  • Stateful stream processing (e.g., Flink CEP, timers)
  • Data contract-driven pipelines
  • Metadata and lineage tools (OpenMetadata, DataHub)

📫 Let’s Connect


Thanks for stopping by! Feel free to explore some of my open-source experiments, workflow prototypes, and real-time data tools 🚴‍♂️

Pinned Loading

  1. Algorithms Algorithms Public

    Data Structure and Algorithms codes

    Java

  2. Hadoop Hadoop Public

    Map Reduce code

    Java

  3. JavaCommonCode JavaCommonCode Public

    general purpose java codes

    Java

  4. DataScience DataScience Public

    datascience contest repo

    Jupyter Notebook

  5. Airflow Airflow Public

    airflow pipeline general purpose code

  6. Data-Analytics Data-Analytics Public