Stars
The lightweight, user-friendly, fault-tolerant database built on SQLite.
A vector search SQLite extension that runs anywhere!
Access a database of word frequencies, in various natural languages.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
💫 Industrial-strength Natural Language Processing (NLP) in Python
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
An extremely fast Python linter and code formatter, written in Rust.
An Elixir Authentication System for Plug-based Web Applications
100+ Chinese Word Vectors 上百种预训练中文词向量
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
The Divergent Association Task is a brief measure of creativity
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
JavaScript player library / DASH & HLS client / MSE-EME player
Custom elements (web components) for making audio and video player controls that look great in your website or app.
Tesseract Open Source OCR Engine (main repository)
Open Source Continuous File Synchronization
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
Distributed reliable key-value store for the most critical data of a distributed system
The Prometheus monitoring system and time series database.
Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens. An open platform for various uses.
A tool for secrets management, encryption as a service, and privileged access management
Consul is a distributed, highly available, and data center aware solution to connect and configure applications across dynamic, distributed infrastructure.