Skip to content
View zalihat's full-sized avatar

Block or report zalihat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 38,193 7,693 Updated Feb 8, 2026

Free MLOps course from DataTalks.Club

Jupyter Notebook 14,182 2,852 Updated Dec 1, 2025

Python code for common Machine Learning Algorithms

Jupyter Notebook 4,555 4,796 Updated Jun 5, 2025

Projects & Resources to help you become a better AI Developer.

TypeScript 2,105 415 Updated Sep 15, 2025

Code, Notebooks and Examples from Practical Business Python

Jupyter Notebook 1,998 974 Updated Mar 7, 2023

TensorFlow Recommenders is a library for building recommender system models using TensorFlow.

Python 1,998 297 Updated Jan 23, 2026

Lightweight package to query popular search engines and scrape for result titles, links and descriptions

Python 487 85 Updated May 3, 2024

This repository contains advanced LLM-based chatbots for Q&A using LLM agents, and Retrieval Augmented Generation (RAG) and with different databases. (VectorDB, GraphDB, SQLite, CSV, XLSX, etc.)

Jupyter Notebook 433 252 Updated Dec 24, 2025

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…

Python 314 138 Updated Feb 14, 2025

Suite of tools containing an in-memory vector datastore and AI proxy

Rust 182 17 Updated Feb 1, 2026

Multiclass image classification using Convolutional Neural Network

Jupyter Notebook 73 31 Updated Aug 11, 2024

YAML based tool for monitoring metrics across multiple hosts

Go 59 4 Updated Jan 9, 2023

Music retrieval CLI and API using rust

Rust 52 2 Updated Apr 21, 2023

A complete real-time Change Data Capture (CDC) pipeline using Apache Flink, MariaDB, and Docker Compose. This project demonstrates how to build a modern streaming analytics system that processes da…

Dockerfile 32 4 Updated Sep 6, 2025

Solved Example Questions of the Book "Data Analytics and Decision Making 4th Edition by Albright, Winston and Zappe" in R

R 20 5 Updated Mar 29, 2018

A collection of all 774 local government areas in Nigeria. All LGAS with State Name, Latitude, Longitude and Wikidata:Identifiers.

PHP 14 10 Updated Jul 27, 2022
Jupyter Notebook 5 9 Updated Jan 23, 2021

Expert knowledge skills for Claude Code to help with specific technologies and tools.

4 Updated Oct 19, 2025

an implementation of an Ingestion framework for lakehouses using dagster, dlthub, slingdata, trino, dremio and minio

Python 4 1 Updated Oct 11, 2024

This is basically a simple web scraping program from Jumia deals --> real estate.

Python 4 Updated Dec 11, 2021

tuberculosis classification in chest radiographs using convolutional neural network and Fastai python library

CSS 4 1 Updated Jun 21, 2022
Jupyter Notebook 4 18 Updated Jan 24, 2021

Simulate PostgreSQL Change Data Capture with realistic inserts, updates, deletes, and optional schema evolution. Ideal for testing CDC pipelines, Debezium connectors, and streaming systems.

Python 3 Updated Apr 25, 2025

This project provisions a modular AWS data pipeline using Terraform. Each AWS service lives in its own directory under infrastructure/services, so you can provision and manage them independently.

Jupyter Notebook 2 1 Updated Oct 17, 2025

A tutorial on how to build a visualization dashboard using dash plotly

Jupyter Notebook 2 Updated Feb 5, 2021

A repository for python for data science resources in jupyter notebooks.

Jupyter Notebook 2 Updated Jan 24, 2023

Modern e-commerce analytics stack: MySQL → S3 → Snowflake → dbt → Dagster. Implements incremental ingestion, SCD handling, data quality checks, and enterprise-grade governance.

Python 1 Updated Nov 28, 2025

Practical examples of Slowly Changing Dimensions (SCD Type 0, 1, and 2) using dbt and MySQL.

1 Updated May 10, 2025
Next