Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
-
Updated
Nov 11, 2025 - Python
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Official NetBox Labs plugin for NetBox for Diode
Python script for ingesting various files into a semantic graph. For text, images, cpp, python, rust, javascript, and PDFs.
The aim of this project is automate data ingestion from flat files like CSV and compressed files GZIP into a database like Postgres. The entire setup is automated using Docker and is pretty fast too as multiprocessing is being used.
Tagbase is a data lifecycle management system for electronic timeseries sensor data. It supports different types of data and works with equipment from various manufacturers.
Python Open-source ETL tool for seamless data movement across PostgreSQL, MySQL, Redshift, BigQuery, S3, GCS, and CSV files, with yaml/json-based configuration.
This project aims to predict smartphone prices using a combination of batch and stream processing techniques in a Big Data environment. The architecture follows the Lambda Architecture pattern, providing both real-time and batch processing capabilities to users.
LogFlow é uma aplicação ETL (Extração, Transformação e Carregamento) especializada em processamento de logs
Ingest any format data into postgreSQL database
A fully interactive tool designed to streamline your GitHub repository prompt generation process and facilitate RAG (Retrieval-Augmented Generation) workflows
Python based ingestion, SQL, Hadoop, Bash scripting
Periodically ingest incremental updates (inserts / deletes) into BigQuery using Cloud Composer / Airflow orchestration workflow
Simple python interface for the Fivetran API. Powered by HTTPx.
This project implements a full-stack data engineering solution that connects to the Spotify Web API to extract a user’s recently played tracks, stores the data in a PostgreSQL database, applies transformations using dbt, and delivers actionable insights via Metabase dashboards.
Reliable, production-style data pipeline that ingests data from multiple sources (API, web scraper, CSV), cleans & standardizes with Pandas/NumPy, loads into SQLite, and surfaces insights through a Streamlit dashboard.
📊 Fetch free historical candlestick data from the Blofin Exchange API without API keys, saving years of data to CSV for analysis and research.
Add a description, image, and links to the ingestion topic page so that developers can more easily learn about it.
To associate your repository with the ingestion topic, visit your repo's landing page and select "manage topics."