- Mangalore, Karnataka, India
- https://gagan-km.github.io/
- in/gagan-k-m-a0580b285
- @gagankm89
Lists (1)
Sort Name ascending (A-Z)
Stars
DigitalPlat FreeDomain: Free Domain For Everyone
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
PlotLLM is an AI-powered Matplotlib plot generator built with Streamlit. Describe the plot you need in natural language, and a LLM reasoning model writes the Python code for you. Run it locally wit…
This repository implements a real-time credit card fraud detection pipeline using Kafka, Spark and Cassandra. Kafka continuously produces credit card transactions that will be analyzed by the Spark…
This is a repo with links to everything you'd ever want to learn about data engineering
Realtime data pipeline using Kafka + Spark + AWS S3 (Terraform) + Snowflake
Scrape tech articles, transform, do sentiment analysis, and push to a MongoDB Atlas database, build an interactive dashboard with Streamlit to be hosted on its community cloud and automated with Gi…
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.
Apache Kafka - A distributed event streaming platform
Azure Data Engineer Project
Design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets
In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it…
Big Data Engineering Course and project work.
Repository containing projects and summaries of my studies in the field of Data Engineering.
Example end to end data engineering project.
Personal Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Superset is a Data Visualization and Data Exploration Platform
Learn how to develop, deploy and iterate on production-grade ML applications.
AWS-native chatbot using Bedrock
This repository helps you learn Python and Machine Learning from scratch.