Unify API-based inference for weave.py by ksadov · Pull Request #7 · JD-P/minihf

ksadov · 2024-06-22T17:19:15Z

Currently, weave.py contains separate functions to infer using OpenAI or VLLM. But both of these APIs, as well as a number of others, are based on the OpenAI v1 completions API. Supporting arbitrary APIs is just a matter of changing base url and API key.

I made some changes to weave.py inference functions and args to support this. Examples to try:

Business as usual: python weave.py --gen-model-name mistralai/Mistral-7B-v0.1 --eval-model-name jdpressman/minihf_evaluator_mistral_7b_v0.1
Generating with a local model, evaluating with an OpenAI model: python weave.py --gen-model-name openai-community/gpt2 --eval-model-name davinci-002 --eval-api-base https://api.openai.com/v1/completions --eval-api-key $OPENAI_API_KEY
- you'll need to comment out repetition_penalty and top_k in the evaluate_outputs_api payload to get it to work
Generating with a model hosted on Together AI, evaluating with a model running on a local llama.cpp server: python weave.py --gen-model-name mistralai/Mistral-7B-v0.1 --gen-api-base https://api.together.xyz/v1/completions --gen-api-key $TOGETHERAI_API_KEY --eval-model-name Meta-Llama-3-8B-Q4_5_M --eval-api-base http://localhost:5000/v1/completions
- you'll need to comment out seed in the generate_outputs_api payload to get it to work

It's not ideal that getting certain APIs to work requires so much commenting, but I'd prefer to handle that via config files rather than tacking on more command line args, and that seems like a job for a separate PR.

…nse print statements

ksadov added 4 commits June 22, 2024 11:00

unify vllm and openai inference

35383fc

seperate evaluate and generate api usage

579dc94

set default dtype of generator back to int8 and comment out api respo…

f167337

…nse print statements

set api bases to None by default

f86cb73

ksadov mentioned this pull request Jun 24, 2024

Config files #8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify API-based inference for weave.py#7

Unify API-based inference for weave.py#7
ksadov wants to merge 4 commits intoJD-P:mainfrom
ksadov:unify_api

ksadov commented Jun 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

ksadov commented Jun 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments