ingestion
Here are 148 public repositories matching this topic...
Transform any codebase or techstack in Visual Studio to prompt-friendly text for LLMs!
-
Updated
Apr 8, 2025 - JavaScript
📊 Fetch free historical candlestick data from the Blofin Exchange API without API keys, saving years of data to CSV for analysis and research.
-
Updated
Nov 12, 2025 - Python
Simple python interface for the Fivetran API. Powered by HTTPx.
-
Updated
Jun 5, 2024 - Python
This project implements a full-stack data engineering solution that connects to the Spotify Web API to extract a user’s recently played tracks, stores the data in a PostgreSQL database, applies transformations using dbt, and delivers actionable insights via Metabase dashboards.
-
Updated
Apr 7, 2025 - Python
The dbsched bundle is preconfigured with the Pado scheduler to periodically execute jobs that dump database tables to CSV files from which it automatically extracts column information to generate the corresponding VersionedPortable classes. It then transforms the CSV records to objects using the generated classes before ingesting them into Hazel…
-
Updated
Oct 14, 2025 - Shell
Lab n°2 of "Applications of Big-Data" @ Efrei Paris
-
Updated
Dec 13, 2020 - Jupyter Notebook
Reliable, production-style data pipeline that ingests data from multiple sources (API, web scraper, CSV), cleans & standardizes with Pandas/NumPy, loads into SQLite, and surfaces insights through a Streamlit dashboard.
-
Updated
Oct 19, 2025 - Python
Data ingestor that reads and parses executive orders from wikisource
-
Updated
Apr 2, 2017 - Python
A data pipeline management platform
-
Updated
Dec 9, 2022 - JavaScript
# SciCat Data Ingestion with TypeScript 📥✨ This repository provides a **TypeScript-based tool** for importing and ingesting data into **SciCat**, the science data catalog used at the **European Spallation Source (ESS)**. --- ## Features ✨ - **Data Ingestion**: Automates data import into SciCat. - **TypeScript Implementation**: Ensures ty
-
Updated
Jan 1, 2025 - TypeScript
This project ingests YouTube video data related to fishing, stores it in MongoDB, and provides visualizations through Metabase for analysis.
-
Updated
Dec 6, 2024 - Python
Read-only mirror of https://gitlab.com/sorcero/community/ingestum
-
Updated
Jan 23, 2023 - Python
A resilient, prefix-sharded ingestion pipeline for large static breach dumps (e.g. AntiPublic), optimized for low-resource environments (e.g., Raspberry Pi + NAS/SSD).
-
Updated
Jun 13, 2025 - JavaScript
Ingest data fetched from Snowflake to ElasticSearch
-
Updated
Oct 27, 2020 - Python
This repository is dedicated to learning LangChain by creating a generative AI application. This web application uses Pinecone as a vector store to answer questions related to LangChain, utilizing sources from the official LangChain documentation.
-
Updated
Jul 30, 2024 - Python
Improve this page
Add a description, image, and links to the ingestion topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ingestion topic, visit your repo's landing page and select "manage topics."