5 releases (3 breaking)
| 0.4.0 | Apr 18, 2026 |
|---|---|
| 0.3.0 | Apr 12, 2026 |
| 0.2.2 | Apr 7, 2026 |
| 0.2.0 | Apr 6, 2026 |
| 0.1.2 | Mar 29, 2026 |
#86 in #language-framework
640KB
13K
SLoC
VIL Evaluation Framework (H10).
Provides metrics, datasets, batch evaluation, and reporting for LLM output quality.
use vil_eval::{EvalDataset, EvalCase, EvalRunner, AnswerRelevance};
let mut dataset = EvalDataset::new();
dataset.add_case(EvalCase {
question: "What is Rust?".to_string(),
context: "Rust is a systems programming language.".to_string(),
answer: "Rust is a systems programming language focused on safety.".to_string(),
reference: None,
});
let runner = EvalRunner::new(dataset).add_metric(Box::new(AnswerRelevance));
let report = runner.run();
assert_eq!(report.case_count(), 1);
vil_eval
VIL Evaluation Framework — metrics, dataset, batch evaluation, reporting (H10)
Part of VIL
This crate is part of VIL — a process-oriented language and framework for building zero-copy, high-performance distributed systems.
License
Licensed under either of Apache License 2.0 or MIT License.
Dependencies
~29–49MB
~700K SLoC