- Taiwan, Taipei
Lists (32)
Sort Name ascending (A-Z)
Algorithms and DataStructures
Audio_Sound_Engines
Auto-DL
BC-Learning
Certificates
Conferences
Conferences_Challenges
Contranstive-Learning
Data_Augmentation
Data_Labeling_Annotation
Dimension_Reduction
DL_Architectures_Models
DL_Training_Methods_Tricks
Emulatora and Simulators
Generative_Models_Audio_Speech
Model_Compression
Model_Compression_Pruning_PTQ_QAT_KD
Model_Conversion
Nerual_Architectures_Search
NN_Frameworks
NN_Libraries_Tools
Papers
Python Libraries
Pytorch Programming
Rust
SED_ESC
Self-Supervised-Learning
Sound_Audio_Analysis_Exploration
Speech_Sound_Related_Libraries
Tiny_NN_Architectures_Models
TinyML
TTS
- All languages
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- CoffeeScript
- Cuda
- Dart
- Dockerfile
- Eagle
- Elm
- Erlang
- GCC Machine Description
- Go
- HCL
- HTML
- Handlebars
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Nim
- OCaml
- Objective-C
- PHP
- Pascal
- Perl
- PowerShell
- Python
- QML
- R
- Roff
- Ruby
- Rust
- Scala
- Shell
- Swift
- TSQL
- TeX
- TypeScript
- VHDL
- Vala
- Verilog
- Vue
- WebAssembly
Starred repositories
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Lemon AI is the first Full-stack Open-source Self-Evolving General AI Agent, offering a fully local alternative to Agentic platforms like Manus & Genspark AI.🔔 Official updates X(twitter) @LemonAI_cc
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Automatic generation of presentation for an academic paper.
GPT-SoVITS ONNX Inference Engine & Model Converter
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A collection of various Markdown files for testing purposes.
A simple file browser distributed as a custom element
A filemanager template built using JavaScript and SASS without any external dependencies or frameworks.
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large language model
A natural language interface for computers
An open-source alternative to NotebookLM based on Python
A fully open-source, LlamaCloud-backed alternative to NotebookLM
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Wyoming protocol server for Piper text to speech system
Faster whisper Running on AMD GPUs with modified CTranslate 2 Libraries served up with Wyoming protocol
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
🖥️ 🍭 Printing Pretty Tables on your console
Using Low-rank adaptation to quickly fine-tune diffusion models.
A research about how to fuse logit, prosodic and text data using titans