🤖 Generate unique synthetic datasets effortlessly with FauxFoundry, using local LLMs and YAML specifications for efficient, schema-aware data creation.
-
Updated
Nov 10, 2025 - Go
🤖 Generate unique synthetic datasets effortlessly with FauxFoundry, using local LLMs and YAML specifications for efficient, schema-aware data creation.
🌐 Generate diagrams effortlessly with Mermaid AI Diagram Generator, transforming your ideas into visual representations quickly and clearly.
An advanced Industrial IoT (IIoT) simulator for Smart Factory 4.0 environments using Python, MQTT, and Docker. Emulates configurable production lines with realistic sensor data (vibration, temperature, quality) and predictive alerts.
Curated collection of multimodal data synthesis methods, covering papers, datasets, and best practices for vision-language model training
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
Synthetic data generation for tabular data
Synthetic healthcare data generator for EHR testing and interoperability. Generates realistic, terminology‑backed cohorts with multi‑format export (FHIR R4, HL7 v2.x, VistA MUMPS, CSV/Parquet).
Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend implementation.
🧠 Model-driven synthetic test data for CI/CD and analytics - deterministic, privacy-preserving, and domain-aware. Includes Python APIs, XML pipelines, and MCP/IDE integration to orchestrate realistic datasets for finance, healthcare, and other regulated environments.
This is a collection of TDK demo projects that use different databases and options
Model Context Protocol (MCP) server for generating fake/mock data using Faker.js
[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI
A novel approach for synthesizing tabular data using pretrained large language models
A library to model multivariate data using copulas.
🚀 AI-powered synthetic data generator that creates educational flowcharts and diagrams using LangGraph workflows. Features FastAPI integration, OpenAI LLM processing, and automated Mermaid diagram generation with iterative quality improvement through reflection patterns.
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
Add a description, image, and links to the data-generation topic page so that developers can more easily learn about it.
To associate your repository with the data-generation topic, visit your repo's landing page and select "manage topics."