WebAssembly exmaple for speaker diarization #1411

csukuangfj · 2024-10-10T14:05:17Z

We have provided two spaces for you to try speaker diarization inside your browser with WebAssembly.

You can download test wave files from
https://github.com/k2-fsa/sherpa-onnx/releases/tag/speaker-segmentation-models

For instance, https://github.com/k2-fsa/sherpa-onnx/releases/download/speaker-segmentation-models/0-four-speakers-zh.wav

Caution: It uses a single thread with WebAssemly and it is a bit slow. The main point of the demo is to show that we support running speaker diarization with WebAssembly.

Note that the speaker segmentation model is from pyannote-audio. However, it does not depend on pyannote-audio.

As you know, pyannote-audio supports only Python; but our implementation is based on C++.

Huggingface space	ModelScope space
URL	URL

csukuangfj added 2 commits October 10, 2024 21:06

WebAssembly example for speaker diarization

246f257

publish to huggingface and modelscope

1c46a56

csukuangfj merged commit 1d061df into k2-fsa:master Oct 10, 2024
147 of 199 checks passed

csukuangfj deleted the wasm-speaker-diarization branch October 10, 2024 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WebAssembly exmaple for speaker diarization #1411

WebAssembly exmaple for speaker diarization #1411

csukuangfj commented Oct 10, 2024

WebAssembly exmaple for speaker diarization #1411

WebAssembly exmaple for speaker diarization #1411

Conversation

csukuangfj commented Oct 10, 2024