An API for generating synthetic datasets using a Large Language Model (LLM).
-
Updated
Nov 8, 2024
An API for generating synthetic datasets using a Large Language Model (LLM).
Python package to generate texts using batch inference from LLM providers.
neural networks are cool but can they count?
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
Data generation for SQL queries - project from my master thesis
eCommerce Data as a Service - robot crawler that collect and aggregate public data from eCommerce websites, then parse it into a meaningful datasets that help eCommerce owners to take smarter businesses decisions.
A containerized implementation of the VAMBN approach by TA6.4.
Details the data modeling techniques used, the functionality of the output, and an in-depth idea of how a plan finder works based off of user inputs.
This project generates fake e-commerce order data on a daily basis. The data is saved as a CSV file and automatically uploaded to an S3 bucket using AWS Lambda and Amazon EventBridge Scheduler.
AI powered demo with vector embeddings similarities detection, 4D 20 R10 required
Функции Excel.
The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision", published in WWW 2025.
In this research project, we consider a molecular communication system that is made of a 3D unbounded diffusion channel model without flow, a point transmitter, and a spherical absorbing receiver. In particular, we study the impact of inter-symbol interference and analyze the performance of different threshold-based receiver schemes. This resear…
FsSpec represents value constraints as data to reuse one constraint declaration for validation, data generation, error explanation, and more.
A library for doing image augmentation
Code-Switched Data generation based on Part-of-speech and Language Modeling of the generated text.
Generate and evaluate synthetic tabular data using GANs with visual comparisons.
Augment robot training data with generative media
Add a description, image, and links to the data-generation topic page so that developers can more easily learn about it.
To associate your repository with the data-generation topic, visit your repo's landing page and select "manage topics."