Starred repositories
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Filament is a real-time physically based rendering engine for Android, iOS, Windows, Linux, macOS, and WebGL2
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
A tensorflow implementation of EAST text detector
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
CPU inference for the DeepSeek family of large language models in C++
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust