- Beijing, China
- weixiao-huang.github.io
Stars
Horust is a supervisor / init system written in rust and designed to run inside containers.
Kimi K2 is the large language model series developed by Moonshot AI team
A command line tool for calculating crc64, like sha256sum and md5sum.
FlashMLA: Efficient Multi-head Latent Attention Kernels
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DLRover: An Automatic Distributed Deep Learning System
OpenID Connect (OIDC) identity and OAuth 2.0 provider with pluggable connectors
This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service controller implementations will be compatible with MCS data pl…
A distributed transaction framework, supports workflow, saga, tcc, xa, 2-phase message, outbox patterns, supports many languages.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Ongoing research training transformer models at scale
Various distributed Torch benchmarks
The road to hack SysML and become an system expert
The web framework for content-driven websites. ⭐️ Star to support our work!
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
The central registry of Bazel modules for the Bzlmod external dependency system.
An awesome & curated list of best LLMOps tools for developers
Kubernetes Virtualization API and runtime in order to define and manage virtual machines.
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…