Skip to content
View bhatiaarjun19's full-sized avatar

Block or report bhatiaarjun19

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 39,041 7,490 Updated Dec 15, 2025

Data Engineering, SQL, Exploratory Data Analysis (EDA), Machine Learning (Python), Business Intelligence (BI)

Jupyter Notebook 14 1 Updated Jul 4, 2024

LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.

Jupyter Notebook 4,478 810 Updated Dec 1, 2025

👨🏻‍💻 Personal profile+blog created using Gatsby, React & GraphQL.

SCSS 3 2 Updated Mar 20, 2024

💻 Automated personal dotfiles for macOS workspace/settings. One-click install & setup.

Shell 4 Updated Nov 18, 2025

My "junkyard" homelab build with a k3s cluster running on a bunch of Raspberry Pi 4Bs, flash drives and HDD. Running on a budget.

7 1 Updated Oct 19, 2024

LLM-powered document chat using Amazon Bedrock and AWS Serverless

TypeScript 293 272 Updated Nov 24, 2025

My second attempt at the Data Engineering Zoomcamp project

R 5 1 Updated Apr 19, 2024
Jupyter Notebook 2 Updated Apr 16, 2024

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 33,996 7,203 Updated Dec 19, 2025

The code in this repository encompasses a real-time election voting system constructed with Python, Kafka, Spark Streaming, Postgres, and Streamlit. The implementation leverages Docker Compose to e…

Python 5 5 Updated Jan 26, 2024

RAG enabled Chatbots using LangChain and Databutton

Python 164 51 Updated Nov 6, 2023

Project video on my Youtube channel about building an audio content analyzer dashboard.

Jupyter Notebook 23 16 Updated Feb 22, 2023

Helm chart to deploy a highly reliable and available Kafka cluster with Postgresql on a kubernetes cluster

Smarty 2 1 Updated Dec 9, 2023

A curated list of awesome dbt resources

1,606 156 Updated Oct 22, 2025

RESTful API with strongly-typed JSON schema for distributed systems. Uses Elasticsearch-Kibana for indexed searching and RabbitMQ as a message broker.

JavaScript 3 Updated Dec 2, 2023

Production-ready REST API with Docker, Hashicorp Packer, custom error handlers and ES7+ support

JavaScript 4 1 Updated Jan 12, 2024

Infrastructure as Code with Hashicorp Terraform for Google Cloud to provision a managed kubernetes cluster (GKE)

HCL 2 2 Updated Dec 8, 2023

Beginner data engineering project - batch edition

HTML 554 190 Updated Jan 22, 2025

Data Engineering Practice Problems

Python 2,440 766 Updated Jan 8, 2025

Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more

HCL 370 83 Updated Nov 28, 2023

Data Engineering with Python, published by Packt

Python 770 309 Updated Jan 30, 2023

Personal Data Engineering Projects

Jupyter Notebook 971 211 Updated Feb 8, 2023
Python 2 Updated Mar 20, 2020

Python code written during my MsBAN (Masters of Science in Business Analytics)

Jupyter Notebook 2 Updated Dec 19, 2023

🧩 Amazon SES, AWS Lambda & DynamoDB

JavaScript 2 1 Updated Jan 3, 2023

🏭 Infrastructure as Code: AWS Cloudformation to create VPCs to launch EC2 instances using a EC2 Launch Template

1 1 Updated Jan 4, 2023

🤖 RESTful backend API service using express and NodeJS

JavaScript 1 1 Updated Jan 18, 2023