Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
scikit-learn: machine learning in Python
A book-in-progress about the Linux kernel and its insides.
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
A library that provides an embeddable, persistent key-value store for fast storage.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
互联网公司技术架构,微信/淘宝/微博/腾讯/阿里/美团点评/百度/OpenAI/Google/Facebook/Amazon/eBay的架构,欢迎PR补充
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
An Emacs configuration bundle with batteries included
Pikiwidb is a Redis-Compatible database developed by Qihoo's infrastructure team.
LIBSVM -- A Library for Support Vector Machines
a curated list of awesome streaming frameworks, applications, etc
Automatically exported from code.google.com/p/smhasher
Do not send pull requests! Automated Git clone of various OpenJDK branches
Reference implementations of MLPerf® training benchmarks
DataStax Python Driver for Apache Cassandra
A scalable machine learning library on Apache Spark
[UNMAINTAINED] A developer-friendly Python library to interact with Apache HBase
Git mirror of the official (mercurial) repository of cpp-btree