Stars
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
real time face swap and one-click video deepfake with only a single image
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Stable Diffusion built-in to Blender
A lightweight 3D Morphable Face Model library in modern C++
リアルタイムボイスチェンジャー Realtime Voice Changer
A python wrapper for Speech Signal Processing Toolkit (SPTK).
🗡️ Pokedex demonstrates modern Android development with Hilt, Material Motion, Coroutines, Flow, Jetpack (Room, ViewModel) based on MVVM architecture.
Interspeech 2019 tutorial materials
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Audio decoding libraries for C/C++, each in a single source file.
😎 Awesome lists about all kinds of interesting topics