More validations #48

farzadab · 2024-07-09T23:42:08Z

This PR separates validation data_sets from training data_sets and adds separate evaluations for the following datasets both in the audio and text-only modes: ["heysquad_human", "anyinstruct", "soda", "peoplespeech"]

Example perplexity/loss curves:

farzadab · 2024-07-17T21:58:13Z

For some reason I had assumed that I had merged this PR but it was just sitting there for a week!
I will add ST evals separately later on.

ultravox/data/datasets.py

ultravox/training/train.py

farzadab · 2024-07-22T18:26:50Z

PTAL @juberti. All comments were applied.

ultravox/training/train.py

* add heysquad and slue-sqa5 datasets * multi-ds evaluations * add spanish and chinese evals * remove chinese and spanish val sets due to hang * "Transcribe <|audio|>" to "Transcribe\n<|audio|>" * _get_messages helper function * moved contenxt len check to _get_query_prompt

- revert max_audio_duration_secs to the default 30s in eval_config_2k.yaml - update poetry.lock as the previous version is outdated

farzadab added 9 commits July 8, 2024 12:00

add heysquad and slue-sqa5 datasets

f74cafa

multi-ds evaluations

57716e4

minor fixes

ce49028

add spanish and chinese evals

5d5cf38

remove chinese and spanish val sets due to hang

a858dfd

is_impossible

4e5e24f

formatting

4eff805

Merge remote-tracking branch 'origin/main' into farzad-more-vals

e6f1585

mypy fixes

4a6ddfd

farzadab marked this pull request as ready for review July 17, 2024 21:54

farzadab requested a review from juberti July 17, 2024 21:57

farzadab requested a review from zqhuang211 July 17, 2024 21:58

juberti reviewed Jul 18, 2024

View reviewed changes

farzadab added 6 commits July 22, 2024 10:40

"Transcribe <|audio|>" to "Transcribe\n<|audio|>"

58afae5

_get_messages helper function

6844a73

mypy type fix

bffe66d

comment for matchtrain

7d58b84

moved contenxt len check to _get_query_prompt

ce7415b

fix test prompt change

18d4d3f

juberti approved these changes Jul 23, 2024

View reviewed changes

ultravox/training/train.py Outdated Show resolved Hide resolved

removing divide by 2 for val_num_samples

cd462a6

farzadab merged commit c3c8dd1 into main Jul 23, 2024

farzadab deleted the farzad-more-vals branch July 23, 2024 17:07

zqhuang211 added a commit that referenced this pull request Feb 12, 2025

Minor fix (#48)

1c3db70

- revert max_audio_duration_secs to the default 30s in eval_config_2k.yaml - update poetry.lock as the previous version is outdated

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More validations #48

More validations #48

Uh oh!

farzadab commented Jul 9, 2024 •

edited

Loading

Uh oh!

farzadab commented Jul 17, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

farzadab commented Jul 22, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

More validations #48

More validations #48

Uh oh!

Conversation

farzadab commented Jul 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

farzadab commented Jul 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

farzadab commented Jul 22, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

farzadab commented Jul 9, 2024 •

edited

Loading

farzadab commented Jul 17, 2024 •

edited

Loading