Skip to content

refactor(rag-evaluator): shared shell, domain refs, and example agent#12

Draft
davidberenstein1957 wants to merge 3 commits into
feat/text2sql-evaluatorfrom
feat/rag-evaluator-alignment
Draft

refactor(rag-evaluator): shared shell, domain refs, and example agent#12
davidberenstein1957 wants to merge 3 commits into
feat/text2sql-evaluatorfrom
feat/rag-evaluator-alignment

Conversation

@davidberenstein1957

Copy link
Copy Markdown
Member

Summary

  • Slim rag-evaluator/SKILL.md to the shared evaluator shell pattern.
  • Add parallel domain refs and replace superseded rag-eval-dimensions / synthetic-qa-generation docs.
  • Add runnable example-agent/ with retrieval guardrails and pytest suite.

Depends on

  • text2sql-evaluator PR (stacked)

Test plan

  • cd oss/checks/rag-evaluator/example-agent && uv sync && uv run pytest eval/test_retrieval_guardrails.py -q
  • Verify shared shell links in SKILL.md

Made with Cursor

@davidberenstein1957 davidberenstein1957 force-pushed the feat/text2sql-evaluator branch from 859300d to 51b2355 Compare May 20, 2026 17:18
@davidberenstein1957 davidberenstein1957 force-pushed the feat/rag-evaluator-alignment branch from 5857189 to 5b962d3 Compare May 20, 2026 17:18
davidberenstein1957 and others added 3 commits May 20, 2026 19:18
Slim SKILL.md to the evaluator shell pattern, add parallel domain
references and a runnable example-agent, and remove superseded
rag-eval-dimensions and synthetic-qa-generation docs.

Co-authored-by: Cursor <cursoragent@cursor.com>
Reflect shared shell refactor, domain references, and example-agent.

Co-authored-by: Cursor <cursoragent@cursor.com>
Cross-link domain simulate-users docs to shared multi-turn guidance on
different UserSimulator instances per .interact() step.

Co-authored-by: Cursor <cursoragent@cursor.com>
@davidberenstein1957 davidberenstein1957 force-pushed the feat/rag-evaluator-alignment branch from 5b962d3 to 9527fc4 Compare May 20, 2026 17:18
@davidberenstein1957 davidberenstein1957 marked this pull request as draft May 21, 2026 06:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

1 participant