build and benchmark deep research
-
Updated
Mar 28, 2026 - Python
build and benchmark deep research
Optical Flow Dataset and Benchmark for Visual Crowd Analysis
Machine Learning Benchmark Scripts
Generate performance reports from your django database performance tests.
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms
A Framework for Benchmarking Clustering Algorithms
Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control
A CLI for benchmarking Scrapy.
A comprehensive and efficient long-context model evaluation framework
Collection of Multi-Fidelity benchmark functions
Editor, bench tool and a daily notifier for supercharging Advent of Code!
BenchPush is a comprehensive benchmarking suite designed for mobile robots performing pushing-based tasks. It provides simulated environments, evaluation metrics, and baseline demonstrations, and is available as an open-source Python library.
SER Evals: In-Domain and Out-of-Domain Benchmarking for Speech Emotion Recognition
Hosts domain and instance RDDL files, covering problems from a wide range of disciplines, integration with the pyRDDLGym ecosystem.
[FSE 2023] Comparison and Evaluation on Static Application Security Testing (SAST) Tools for Java
Optimisation problem library
CoreBench (not to be confused with CORE-bench or CoREBench) is a CPU benchmarking system that aims to produce an unbiased and community-driven database of real world CPU data. This is the repository for the benchmark suite for Linux.
TSPERF Time Series Database Benchmark Suite. Framework for evaluating and comparing the performance of time series databases, in the spirit of TimescaleDB's TSBS.
A CLI for benchmarking Scrapy
Add a description, image, and links to the benchmark-suite topic page so that developers can more easily learn about it.
To associate your repository with the benchmark-suite topic, visit your repo's landing page and select "manage topics."