-
Shanghai Jiao Tong University
- Ann Arbor, MI
- https://risc-lt.github.io/
- @letianruan
Highlights
- Pro
Stars
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
GoogleTest - Google Testing and Mocking Framework
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, in…
Hyprland is an independent, highly customizable, dynamic tiling Wayland compositor that doesn't sacrifice on its looks.
A library that provides an embeddable, persistent key-value store for fast storage.
Productive, portable, and performant GPU programming in Python.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
FlashMLA: Efficient Multi-head Latent Attention Kernels
ZeroMQ core engine in C++, implements ZMTP/3.1
Highly customizable Wayland bar for Sway and Wlroots based compositors. ✌️ 🎉
The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
High-speed Large Language Model Serving for Local Deployment
Transformer related optimization, including BERT, GPT
Pikiwidb is a Redis-Compatible database developed by Qihoo's infrastructure team.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Optimized primitives for collective multi-GPU communication
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Tutorial code on how to build your own Deep Learning System in 2k Lines