IDC (Image Dataset Creator) Tool for Windows
-
Updated
Sep 21, 2025 - TypeScript
IDC (Image Dataset Creator) Tool for Windows
A modern, visual editor for spintax (spinning syntax) with tree-based editing, live preview, and YAML export. Ideal light-weight AI training data generation alternative. Built with Next.js, TailwindCSS, and TypeScript.
A benchmark dataset of real-world code review comments, designed to evaluate automated code review software/agents.
Dynamically add columns to your (.csv) Dataset using the OpenAI API
The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
🧠 Transform reasoning tasks with OpenReason, a powerful engine that streamlines query handling across any LLM provider for clear, confident results.
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
A full-stack webapp for collecting and managing speech datasets.
Quickly and efficiently caption your image dataset for AI training
Proxy server that automatically stores messages exchanged between any OAI-compatible frontend and backend as a ShareGPT dataset to be used for training/finetuning.
The dataset builder script extracts the most relevant market data straight from Binance's API and builds a series of datasets that can be used in data science and machine learning projects.
An easy to use imageboard scraper.
DescribeML is a Visual Studio Code language plug-in to describe machine-learning datasets in a structured format. Build better data describing the composition, provenance and social concerns of your dataset.
Torque is a Declarative, typesafe DSL for building synthetic LLM datasets — compose conversations like React components
🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Add a description, image, and links to the dataset-generation topic page so that developers can more easily learn about it.
To associate your repository with the dataset-generation topic, visit your repo's landing page and select "manage topics."