Popular repositories Loading
-
qed-bench
qed-bench PublicReproducible benchmarks comparing small scoring models against LLM-as-judge on essay quality, spam, AI-text detection, and LLM authorship — measuring quality, cost, and latency on the same Pareto f…
Jupyter Notebook
-
Repositories
Showing 3 of 3 repositories
- docs Public
u22a8/docs’s past year of commit activity - qed-bench Public
Reproducible benchmarks comparing small scoring models against LLM-as-judge on essay quality, spam, AI-text detection, and LLM authorship — measuring quality, cost, and latency on the same Pareto frontier.
u22a8/qed-bench’s past year of commit activity - plugins Public
Plugins (Claude Code, GitHub Actions) for scoring, evaluating, and improving content with DLMs (U/=22A8)
u22a8/plugins’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…