Skip to content
View Krunal96369's full-sized avatar
πŸ’™
πŸ’™

Highlights

  • Pro

Block or report Krunal96369

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Krunal96369/README.md

πŸ‘‹ Hey, I'm Krunal Chauhan

πŸš€ Software Engineer (Data/Platform) β€’ Real-time Systems β€’ Cloud-Native

I design and ship reliable systems: data pipelines, streaming services, and APIs that turn messy inputs into clean, fast, observable products. 3+ years in the trenches, currently finishing a Master’s in Software Engineering @ UHCL. I care about smart abstractions, tests that matter, and shipping code that’s easy to run at 3 a.m.

  • πŸ”­ Building: event-driven backends, real-time analytics, resilient pipelines
  • πŸ§ͺ Loop: model β†’ automate β†’ deploy β†’ measure
  • ☁️ Clouds: AWS (S3, EMR, Glue, Lambda, Redshift), GCP
  • πŸ“ Houston-ish β€’ Remote-friendly

🧰 Tech Toolbox

Languages: Python, SQL, Java, JavaScript, Typescript
Backend & APIs: FastAPI, Node.js, REST
Streaming & Compute: Kafka, Apache Spark, AWS Glue/EMR/Lambda
Data Stores: PostgreSQL, Redshift, DynamoDB, MongoDB, Firebase
Modeling & Orchestration: dbt, Apache Airflow
DevOps: Docker, GitHub Actions, Prometheus, Grafana
Frontend (enough to ship): React.js
Testing: PyTest, Postman, smoke tests, contract tests


πŸ“ˆ Featured Projects

πŸ•΅οΈβ€β™‚οΈ Fraud Detection β€” Isolation Forest (Unsupervised)

Repo: fraud_detection
When: May 2025

Automated anomaly detection pipeline using Isolation Forest (PyOD) for financial transactions β€” no labels needed.

  • Real-time scoring with configurable contamination rate
  • Visual analytics: normal vs. anomalous distributions
  • Clean notebooks and reproducible environment

Stack: Python, Pandas, NumPy, Matplotlib, Seaborn, PyOD, Jupyter


🧠 ThinkDocs.ai β€” AI Document Intelligence (RAG)

Production Q&A over unstructured PDFs with low-latency retrieval, stable ingestion, and observability.

  • Sub-200ms answers, doc auto-ingest, dashboards and metrics
    Stack: FastAPI, ChromaDB, Prometheus, Grafana

πŸ€ NBA Analytics Pipeline

ETL β†’ normalized DB β†’ executive dashboards. Computes PER, TS%, and Four Factors with rate-limits and retries.

Stack: Python, SQL, Power BI


πŸ₯ Clinic Ops Platform

Full-stack patient management with RBAC/JWT and real-time alerts. Reduced scheduling time by ~45%.

Stack: React, Node, Postgres, Socket.io, AWS SNS


πŸŽ“ Advising Data Warehouse (UHCL)

Automations across Google Sheets/CRM. Standardized inputs led to ~30% fewer errors and ~40% faster updates.


πŸ’³ Pay + Maps SDK

React + MUI front ends integrating Stripe/Razorpay and Google Maps. Reporting scripts turned hours into minutes.


πŸ§ͺ How I build

  • Event-first modeling: design streams and schemas before code
  • Contracts & tests: JSON schema + smoke + contract tests
  • Guardrails: idempotent jobs, checkpoints, safe backfills
  • Observability: metrics over vibes; useful dashboards and alerts

πŸ“š Currently learning

  • Event-driven system design (sagas, outbox, backpressure)
  • Cost-aware modeling on Redshift / column stores
  • Better DX for dbt + Airflow in mono-repos

🌐 Connect

Pinned Loading

  1. fraud_detection fraud_detection Public

    Jupyter Notebook

  2. NBA_Dashboard NBA_Dashboard Public

    Python

  3. thinkdocs thinkdocs Public

    ThinkDocs is a full-stack AI system that transforms unstructured documents (PDFs, emails, reports) into intelligent, queryable knowledge bases.

    Python