Add support for other models in AutoEval by Divij97 · Pull Request #59 · hegelai/prompttools

Divij97 · 2023-08-04T19:39:22Z

This PR targets targets this feature request

steventkrawczyk · 2023-08-04T19:55:47Z

Hey @Divij97 this looks great! Very elegant way to support Anthropic + OpenAI as evaluators. I'm guessing Claude and GPT will need different eval prompts, but this is definitely headed in the right direction. Let me know when this is ready for a full review

NivekT

Hi,

Thanks for opening this PR!

We refactored how experiment.evaluate() work. The TL;DR is that .evaluate() will apply the evaluation function on a row of results at a time (plus any keyword args for the evaluation function).

At a glance, I don't think it should impact this PR but please rebase and let me know if there is any issue.

NivekT

Hi @Divij97, I have ran the CI and there are some import errors. Will you be able to rebase and have a look?

After that we should be able to merge quickly. Thanks!

Divij97 changed the title ~~Add framework for adding new model evaluators~~ Add support for other models in AutoEval Aug 4, 2023

Divij97 mentioned this pull request Aug 4, 2023

Add support for other models in AutoEval #44

Open

4 tasks

steventkrawczyk requested review from NivekT and steventkrawczyk August 4, 2023 19:51

steventkrawczyk mentioned this pull request Aug 5, 2023

Refactor Experiment #60

Merged

NivekT reviewed Aug 6, 2023

View reviewed changes

add framework for adding new model evaluators

30c03a6

Divij97 force-pushed the support-additional-models branch from 862ca6e to 30c03a6 Compare August 8, 2023 17:50

remove merge conflict and make code cleaner

447fc83

NivekT reviewed Aug 14, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for other models in AutoEval#59

Add support for other models in AutoEval#59
Divij97 wants to merge 2 commits into
hegelai:mainfrom
Divij97:support-additional-models

Divij97 commented Aug 4, 2023 •

edited

Loading

Uh oh!

steventkrawczyk commented Aug 4, 2023

Uh oh!

NivekT left a comment •

edited

Loading

Uh oh!

NivekT left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Divij97 commented Aug 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

steventkrawczyk commented Aug 4, 2023

Uh oh!

NivekT left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NivekT left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Divij97 commented Aug 4, 2023 •

edited

Loading

NivekT left a comment •

edited

Loading