Skip to content
View secsilm's full-sized avatar
🚴
Focusing
🚴
Focusing

Block or report secsilm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Audio/Video

9 repositories

Python bindings for FFmpeg - with complex filtering support

Python 10,891 933 Updated Aug 4, 2024

A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.

C++ 16,460 1,448 Updated Nov 23, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,943 5,857 Updated Aug 16, 2024

Inference and training library for high-quality TTS models.

Python 5,498 582 Updated Dec 10, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 2,051 260 Updated Dec 15, 2025

Slightly improved official version for finetune xtts

Python 378 122 Updated Apr 3, 2025

Webui for using XTTS and for finetuning it

Python 861 167 Updated Jan 17, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,257 2,049 Updated Oct 21, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,862 985 Updated Dec 13, 2025