benchmark
Here are 5,571 public repositories matching this topic...
An automated scoring function to facilitate and standardize the evaluation of goal-directed generative models for de novo molecular design
-
Updated
Sep 23, 2025 - Python
LLM Benchmark Suite for Humanities Image Data
-
Updated
Sep 23, 2025 - Python
Professional Load Testing for Any LLM Services or Routers. 大模型推理服务压测工具,性能测试结果AI分析
-
Updated
Sep 23, 2025 - Python
A Python and MATLAB implementation of mathematical test functions for benchmarking optimization algorithms.
-
Updated
Sep 23, 2025 - C++
Paper: C3 Benchmark: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations
-
Updated
Sep 23, 2025 - HTML
📄 Discover top coding agents through the June 2025 evaluation report, featuring key findings, examples, and complete analysis for informed decision-making.
-
Updated
Sep 23, 2025 - TypeScript
🤖 Explore vital resources for testing AI agents, including frameworks, tools, and best practices to enhance reliability and performance.
-
Updated
Sep 23, 2025
JavaScript package managers performance comparison between NPM, Yarn, Yarn PnP, PnPM, and Bun.
-
Updated
Sep 23, 2025 - JavaScript
SketchUp tools for 3D modeling, push/pull workflows, 4K textures, and extensions. Access the Extension Warehouse and tutorials for professional use. 🐙
-
Updated
Sep 23, 2025
Explore the SE-TESTING repository for comprehensive labs and activities in software testing. Collaborate and enhance your skills with practical examples. 🛠️🌟
-
Updated
Sep 23, 2025 - JavaScript
# Prime-Numbers Web AppThis web app helps users explore prime numbers through features like checking primality and generating sequences. With clear visuals and step-by-step explanations, it makes understanding prime numbers simple and engaging. 🐙✨
-
Updated
Sep 23, 2025 - HTML
Benchmarking the experience of using monorepo tools - Bazel, Gradle, Lage, Lerna, Nx, Pants, Rush, Turborepo
-
Updated
Sep 23, 2025 - TypeScript
Benchmark results repository service
-
Updated
Sep 23, 2025 - Java
Open-source framework for defining Page Language Models (PLMs) for intelligent app understanding and AI-assisted testing.
-
Updated
Sep 23, 2025 - Python
Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
-
Updated
Sep 23, 2025
xVerify: Efficient Answer Verifier for Large Language Model Evaluations
-
Updated
Sep 23, 2025 - Python
This repository contains all my practices and lessons about Parallel Programming in .NET
-
Updated
Sep 23, 2025
A public repository for weather-forecast
-
Updated
Sep 23, 2025
Improve this page
Add a description, image, and links to the benchmark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the benchmark topic, visit your repo's landing page and select "manage topics."