Skip to content

Tags: Tracer-Cloud/opensre

Tags

v2026.6.12

Toggle v2026.6.12's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat(bench): investigation_a1 + translation_loss metrics (#2798)

* added L0 level for opensre score
* fixed namespace match issue

v2026.6.11

Toggle v2026.6.11's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat(bench): structured-outputs predictor + overfit controls (#2794)

* full experiment package (revert + new variant + overfit controls)
* added overfit into bench framework
* fix(bench): address greptile review on structured-outputs PR
* fixed A/A variant issue
* fixed float division
* the same experiment but for N=100

v2026.6.10

Toggle v2026.6.10's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(bench): predictor DB-localization rule (Runtime gap) (#2788)

* added DB-localization rule
* added redis

v2026.6.9

Toggle v2026.6.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(bench): experiments: false-healthy guard with plumbing and bridge…

… contract tests, floor 0 tools; refactoring (#2770)

* fix(bench): false-healthy guard + plumbing + bridge contract tests
* added B2-fire stats column
* fix(bench): gate corpus-required false-healthy tests + B2 validation config
* revert false helath as it did not work
* config for openai comparison
* bench openai config for floor 0
* added config for experiemnt with floor 0 of tools
* fix(bench): add min_tool_calls config field + CLI override
* chore(bench): move configs/ into cloudopsbench/configs/
* move configs/ + cloudopsbench AWS docs into cloudopsbench/

v2026.6.8

Toggle v2026.6.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(bench): cloudopsbench vocab + scope rule + fix-a + taxonomy fix (#…

…2768)

* fix bench config
* improving bench predictor
* fix(bench): skip perf-localization tests when corpus is absent
* fix(bench): guard metric_alerts + obj counter symmetry

v2026.6.7

Toggle v2026.6.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(bench): dispatcher singleton + vocab snap + object_a1 + llm_alone…

… control arm (#2759)

* report consistency-selected A@1, fixed dispatcher bug, and object_a1, object_a3 not emitted in per-case metrics, added floor sweep and pure llm, added MTTI metric, fixed MODEL_CONTEXT_WINDOWS, handled oversized prompt, running experiment, set default min tool cals as 5, was 8

v2026.6.6

Toggle v2026.6.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat(hermes): add surface attribution evaluation suite (#2692)

v2026.6.5

Toggle v2026.6.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(bench): scope promote-image apply to task def, skip Deregister (#…

…2752)

* fixed roles for tag in terraform

v2026.6.4

Toggle v2026.6.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
chore(deps): bump huggingface-hub from 1.15.0 to 1.17.0 (#2727)

Bumps [huggingface-hub](https://github.com/huggingface/huggingface_hub) from 1.15.0 to 1.17.0.
- [Release notes](https://github.com/huggingface/huggingface_hub/releases)
- [Commits](huggingface/huggingface_hub@v1.15.0...v1.17.0)

---
updated-dependencies:
- dependency-name: huggingface-hub
  dependency-version: 1.17.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

v2026.6.3

Toggle v2026.6.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(bench): removed from ignoring infra/bench/entrypoint.sh (#2706)