Skip to content
View hnyls2002's full-sized avatar

Highlights

  • Pro

Block or report hnyls2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 9 4 Updated Jun 18, 2026
TypeScript 8 Updated Mar 23, 2026

A minimalist, open source online pastebin where the server has zero knowledge of pasted data. Data is encrypted/decrypted in the browser using 256 bits AES.

PHP 8,402 998 Updated Jun 21, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,446 710 Updated May 17, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,556 6,676 Updated Jun 23, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,605 273 Updated Jun 23, 2026

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,423 159 Updated Jun 23, 2026

A platform for community discussion. Free, open, simple.

Ruby 47,316 8,944 Updated Jun 23, 2026

The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm

Python 1,101 265 Updated Jun 23, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

8,008 288 Updated May 15, 2025

Analyze computation-communication overlap in V3/R1.

1,170 148 Updated Mar 21, 2025

Use your Neovim like using Cursor AI IDE!

Lua 17,989 828 Updated Jun 22, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 51,233 9,094 Updated Jun 23, 2026

高颜值的第三方网易云播放器,支持 Windows / macOS / Linux :electron:

Vue 32,972 4,703 Updated Jun 14, 2026

A sparse attention kernel supporting mix sparse patterns

C++ 527 55 Updated Jan 18, 2026
C++ 37 7 Updated Jul 19, 2025

ASCII generator (image to text, image to image, video to video)

Python 8,271 650 Updated Nov 22, 2024

💤 A modern plugin manager for Neovim

Lua 21,160 578 Updated Jun 22, 2026

A file explorer tree for neovim written in lua

Lua 8,559 634 Updated Jun 22, 2026

Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)

199 13 Updated May 14, 2026

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 191 39 Updated Jul 10, 2024

Infinite Photorealistic Worlds using Procedural Generation

Python 7,022 599 Updated May 19, 2026

Efficient and easy multi-instance LLM serving

Python 556 50 Updated Mar 12, 2026

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,896 370 Updated Dec 17, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,755 158 Updated Jun 22, 2026

Efficient Triton Kernels for LLM Training

Python 6,452 543 Updated Jun 17, 2026

[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

Python 151 12 Updated Dec 4, 2024

A programming framework for agentic AI

Python 59,182 8,923 Updated Apr 15, 2026

16-fold memory access reduction with nearly no loss

Python 107 9 Updated Mar 26, 2025

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 962 50 Updated Mar 29, 2026
Next