Stars
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Tesseract Open Source OCR Engine (main repository)
A library for efficient similarity search and clustering of dense vectors.
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
An unidentifiable mechanism that helps you bypass GFW.
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
FlashMLA: Efficient Multi-head Latent Attention Kernels
Unsupervised text tokenizer for Neural Network-based text generation.
COLMAP - Structure-from-Motion and Multi-View Stereo
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
A feature-rich dictionary lookup program, supporting multiple dictionary formats (StarDict/Babylon/Lingvo/Dictd) and online dictionaries, featuring perfect article rendering with the complete marku…
Fast inference engine for Transformer models
MITIE: library and tools for information extraction
General purpose unsupervised sentence representations
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
A very simple, fast, multithreaded, platform independent WebSocket (WS) and WebSocket Secure (WSS) server and client library implemented using C++11, Boost.Asio and OpenSSL. Created to be an easy w…
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Fast and customizable text tokenization library with BPE and SentencePiece support