Skip to content
View zhangzy-stone's full-sized avatar

Block or report zhangzy-stone

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,365 11,090 Updated Nov 7, 2025

Mirror of Apache Kafka

Java 31,240 14,773 Updated Nov 7, 2025

Ongoing research training transformer models at scale

Python 14,113 3,249 Updated Nov 7, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 1,322 542 Updated Nov 7, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,235 28,922 Updated Nov 7, 2025

The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems

Go 71,018 18,832 Updated Nov 7, 2025

Linux kernel source tree

C 206,477 58,283 Updated Nov 7, 2025

For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

C 71,586 24,306 Updated Nov 7, 2025

Apache Impala

C++ 1,250 539 Updated Nov 6, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,190 31,066 Updated Nov 6, 2025

Apache HBase

Java 5,453 3,380 Updated Nov 6, 2025

Apache Hadoop

Java 15,353 9,144 Updated Nov 6, 2025

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

C 1,491 491 Updated Nov 6, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,345 479 Updated Nov 6, 2025

Checkpoint/Restore tool

C 3,465 677 Updated Nov 5, 2025

谷歌新书Agent设计模式(agentic design patterns)中文版,持续更新。附:在线阅读、pdf和epub电子书下载。

HTML 552 51 Updated Nov 2, 2025

AI tookit over KubeEdge

Go 524 168 Updated Oct 29, 2025

mirror of MIT krb5 repository

C 580 411 Updated Oct 23, 2025

Open CAS Framework

C 183 83 Updated Oct 14, 2025

memcached development tree

C 14,036 3,314 Updated Oct 9, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,454 1,288 Updated Oct 6, 2025

MinIO Client SDK for Java

Java 1,250 508 Updated Sep 26, 2025

Advanced data structure and algorithm for system design,系统设计需要了解的算法

1,602 289 Updated Sep 23, 2025

Efficient and easy multi-instance LLM serving

Python 506 42 Updated Sep 3, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,382 1,367 Updated Jul 9, 2025

海棠诗社,古诗词的数字桃源

Astro 895 172 Updated Jul 2, 2025

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

384 22 Updated Mar 3, 2025

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means

Java 2,103 232 Updated Feb 17, 2025

libco is a coroutine library which is widely used in wechat back-end service. It has been running on tens of thousands of machines since 2013.

C++ 8,633 2,133 Updated Mar 7, 2024
Next