ingestion
Here are 148 public repositories matching this topic...
# SciCat Data Ingestion with TypeScript 📥✨ This repository provides a **TypeScript-based tool** for importing and ingesting data into **SciCat**, the science data catalog used at the **European Spallation Source (ESS)**. --- ## Features ✨ - **Data Ingestion**: Automates data import into SciCat. - **TypeScript Implementation**: Ensures ty
-
Updated
Jan 1, 2025 - TypeScript
This project ingests YouTube video data related to fishing, stores it in MongoDB, and provides visualizations through Metabase for analysis.
-
Updated
Dec 6, 2024 - Python
A resilient, prefix-sharded ingestion pipeline for large static breach dumps (e.g. AntiPublic), optimized for low-resource environments (e.g., Raspberry Pi + NAS/SSD).
-
Updated
Jun 13, 2025 - JavaScript
Lab n°2 of "Applications of Big-Data" @ Efrei Paris
-
Updated
Dec 13, 2020 - Jupyter Notebook
Reliable, production-style data pipeline that ingests data from multiple sources (API, web scraper, CSV), cleans & standardizes with Pandas/NumPy, loads into SQLite, and surfaces insights through a Streamlit dashboard.
-
Updated
Oct 19, 2025 - Python
The dbsched bundle is preconfigured with the Pado scheduler to periodically execute jobs that dump database tables to CSV files from which it automatically extracts column information to generate the corresponding VersionedPortable classes. It then transforms the CSV records to objects using the generated classes before ingesting them into Hazel…
-
Updated
Oct 14, 2025 - Shell
📊 Fetch free historical candlestick data from the Blofin Exchange API without API keys, saving years of data to CSV for analysis and research.
-
Updated
Nov 13, 2025 - Python
Ingest data fetched from Snowflake to ElasticSearch
-
Updated
Oct 27, 2020 - Python
This repository is dedicated to learning LangChain by creating a generative AI application. This web application uses Pinecone as a vector store to answer questions related to LangChain, utilizing sources from the official LangChain documentation.
-
Updated
Jul 30, 2024 - Python
Merpian Limited - Soverign AI research into personal intelligent assistive nodes
-
Updated
Oct 4, 2025
Go implementation for handling huge amounts of http uploads
-
Updated
Jul 25, 2019 - Go
Performing DataBase Ingester
-
Updated
Jun 21, 2022 - Java
Efetuar o download de arquivos da web com Python. Inserir dados de um dataframe na cloud Azure com Azure SQL Database. Efetuar transformações nos dados com Azure Data Factory.
-
Updated
Jul 25, 2023 - Jupyter Notebook
📈 Fetch free historical candlestick data from Bitget Futures API easily, without API keys, and save clean CSV files for analysis or research.
-
Updated
Nov 13, 2025 - Python
A framework that eliminates the dependency on Apache Spark by leveraging delta-rs for the creation and management of Delta Lake tables. This framework follows Medallion architecture.
-
Updated
Jun 19, 2025 - HTML
Improve this page
Add a description, image, and links to the ingestion topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ingestion topic, visit your repo's landing page and select "manage topics."