Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WebAssembly exmaple for speaker diarization #1411

Merged
merged 2 commits into from
Oct 10, 2024

Conversation

csukuangfj
Copy link
Collaborator

We have provided two spaces for you to try speaker diarization inside your browser with WebAssembly.

You can download test wave files from
https://github.com/k2-fsa/sherpa-onnx/releases/tag/speaker-segmentation-models

For instance, https://github.com/k2-fsa/sherpa-onnx/releases/download/speaker-segmentation-models/0-four-speakers-zh.wav

Caution: It uses a single thread with WebAssemly and it is a bit slow. The main point of the demo is to show that we support running speaker diarization with WebAssembly.

Note that the speaker segmentation model is from pyannote-audio. However, it does not depend on pyannote-audio.

As you know, pyannote-audio supports only Python; but our implementation is based on C++.

Huggingface space ModelScope space
URL URL
Screenshot 2024-10-10 at 21 54 36 Screenshot 2024-10-10 at 21 38 49

@csukuangfj csukuangfj merged commit 1d061df into k2-fsa:master Oct 10, 2024
147 of 199 checks passed
@csukuangfj csukuangfj deleted the wasm-speaker-diarization branch October 10, 2024 14:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant