Starred repositories
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Tencent Cloud API 3.0 SDK for C++
Converts SRT subtitle file to SSML file with speech durations
C++ Library Manager for Windows, Linux, and MacOS
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
直播源相关资源汇总 📺 💯 IPTV、M3U —— 勤洗手、戴口罩,祝愿所有人百毒不侵
C++ version based on the Marching Cubes library of the paper: http://jcgt.org/published/0008/03/01
A public domain/MIT header-only marching cube implementation in C++ without anything fancy.
A Qt Widget encapsulated CEF view based on QWidget
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
A cross-platform bilibili toolbox. 跨平台哔哩哔哩工具箱,支持下载视频、番剧等等各类资源
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
离线语音输入简/繁体、中译英、字幕转录;在线多译多、云剪贴板等等 (选用SenseVoice模型 支持中粤英日韩多语种)
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A debugging and profiling tool that can trace and visualize python code execution
KDAB's collection of miscellaneous useful C++ classes and stuff
The swiss army knife of lossless video/audio editing