New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Handle audio files less than 10s for speaker diarization. #1412

Merged

csukuangfj merged 1 commit into k2-fsa:master from csukuangfj:fix-speaker-diarzation-short-audio

Oct 11, 2024

Collaborator

csukuangfj commented Oct 11, 2024

If the input audio file is less than 10 seconds long, there is only one chunk, and there is no need to compute embeddings or do clustering.

We can use the segmentation result from the speaker segmentation model directly.


          Handle audio files less than 10s for speaker diarization.

b200bd9

csukuangfj merged commit f1b311e into k2-fsa:master

50 of 200 checks passed

csukuangfj deleted the fix-speaker-diarzation-short-audio branch

October 11, 2024 02:27

csukuangfj restored the fix-speaker-diarzation-short-audio branch

October 11, 2024 02:29

csukuangfj deleted the fix-speaker-diarzation-short-audio branch

October 11, 2024 02:32

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet