Open-source framework for defining Page Language Models (PLMs) for intelligent app understanding and AI-assisted testing.
-
Updated
Dec 14, 2025 - Python
Open-source framework for defining Page Language Models (PLMs) for intelligent app understanding and AI-assisted testing.
Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
This repository contains all my practices and lessons about Parallel Programming in .NET
A public repository for weather-forecast
GTA (Guess The Algorithm) Benchmark - A tool for testing AI reasoning capabilities
🚀 Achieve rapid training of NanoGPT (GPT-2 124M) on a single RTX 4090, targeting a validation loss below 3.28 with FineWeb-Edu data.
🔬 Benchmark Spatial Transcriptomics data analysis with optimized pipelines using Seurat and Giotto for reproducible results and biological insights.
🔍 Evaluate synthetic data quality against real tabular datasets with Autocurator, measuring fidelity, coverage, privacy, and utility through clear metrics and visual reports.
🔍 Measure data authenticity and quality in synthetic analytics for safer AI. Explore relationships, diversity, and truthfulness in modern machine learning.
📈 Analyze and optimize portfolios with this interactive app, featuring modern techniques and tools for effective quantitative portfolio management.
🌐 Explore Graph Neural Networks through this comprehensive course, covering essential architectures and cutting-edge research in deep learning.
🕒 Simulate file timestamps in Linux/Unix applications by hijacking access functions, enabling seamless time manipulation without altering the filesystem.
🤖 Explore curated resources for Human Activity Recognition (HAR), including datasets for action recognition, motion capture, and pose estimation.
🚀 Benchmark GPU and CPU performance accurately across diverse hardware using PyTorch and TensorFlow, generating metrics and dashboards for optimization.
🧪 Streamline testing for Hono applications with a versatile toolkit for Cloudflare Workers, Node.js, and Bun, featuring smart data generation and HTTP utilities.
Add a description, image, and links to the benchmark topic page so that developers can more easily learn about it.
To associate your repository with the benchmark topic, visit your repo's landing page and select "manage topics."