Klipse is a JavaScript plugin for embedding interactive code snippets in tech blogs.
-
Updated
Oct 1, 2024 - HTML
Klipse is a JavaScript plugin for embedding interactive code snippets in tech blogs.
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.
EMNLP 2025 Oral benchmark for evaluating LLM knowledge of ESG and sustainability standards, with 1,136 source-grounded questions, evaluation code, and interactive 50-model results.
RewardAnything: Generalizable Principle-Following Reward Models
abed: A command line tool for easily running machine learning benchmarks
BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.
This is a test-ware for the evaluation of Protractor + Cucumber test automation tools.
Monitoring and evaluating LLM apps with Langfuse. Presented at PyConZA 2024.
Exploring Mouse-Contingent Reading Times.
Open-source toolkit ecosystem for development research, evaluation, and program design. Built in India, shared for impact.
Google ADK Evaluation Service(Docker + HTML)
Generate quiz and evaluations with markdown and AI
Public website for the ICDAR 2021 Competition on Historical Map Segmentation
FastEval Parkinsonism is a AI-based online solution for self-assessing parkinsonism in real time.
Official RanxHub repository
Projet d'évaluation, sur un jeu de paires (memory) dans le cadre de ma formation DWWM.
Content-Based Recommender - Adam Hącia 2022
The Evaluator app is a powerful and intuitive tool designed to streamline the evaluation process across professional performance and project assessments. With its user-friendly interface and customizable criteria, Evaluator simplifies the task of scoring and providing feedback.
It's a basic repo having code of a random HTML page, represent Masai WEB-101-Course evaluation.
Add a description, image, and links to the evaluation topic page so that developers can more easily learn about it.
To associate your repository with the evaluation topic, visit your repo's landing page and select "manage topics."