A program that simulates answers given by a crowd to multiple choice questions with either a single or multiple answers correct, and writes it to a CSV
-
Updated
Jun 21, 2022 - Python
A program that simulates answers given by a crowd to multiple choice questions with either a single or multiple answers correct, and writes it to a CSV
Software to simulate compendium-wide gene expression data using a VAE.
Code for reproducing my thesis results.
Cryptocurrency reddit sentiment analysis application.
A sample database with a random data model and automatic reporting. (PL doc)
Rocket Flight Simulation project
Создание синтетического датасета на основе cимуляции свойств физики SEM
Code for JDST 2023 paper: "Simulating Realistic Continuous Glucose Monitor Time Series By Data Augmentation" by L.Gomez, A.Toye, R.Hum and S.Kleinberg
This repository contains projects and exercises I completed during my "Big Data Architecture" course. It reflects the concepts I’ve learned about data processing using Apache Spark and PySpark.
Agent4Edu: Generating Learner Response Data by LLM-based Agents for Intelligent Education Systems (AAAI 2025)
High-performance, multi-stream data ingestion simulator Built for testing real-time pipelines, PB-scale throughput, and stream processing systems like Kafka, Flink, FastAPI, and Iceberg.
🧠 Model-driven synthetic test data for CI/CD and analytics - deterministic, privacy-preserving, and domain-aware. Includes Python APIs, XML pipelines, and MCP/IDE integration to orchestrate realistic datasets for finance, healthcare, and other regulated environments.
[ECCV 2024] Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging
Симулятор рыночного мониторинга с автоматическим сбором данных и SQL-аналитикой (MAX, MIN, AVG). 📈🗄
Foundational tools for BCG X's data science packages.
An application for randomly generating telecommunication payment data.
This project simulates user behavior on a SaaS learning platform and analyzes product growth metrics using Python. The analysis focuses on understanding how users move through the product funnel, identifying drop-off points, and evaluating experiments that aim to improve conversion to paid users. The project also includes an A/B testing simulation
Schema-aware synthetic data for databases, APIs, and pipelines. Realistic, relational, privacy-safe.
Add a description, image, and links to the data-simulation topic page so that developers can more easily learn about it.
To associate your repository with the data-simulation topic, visit your repo's landing page and select "manage topics."