Skip to content
View LeleCheung's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Institute of Computing Technology, Chinese Academy of Sciences
  • Beijing
  • 17:11 (UTC +08:00)

Highlights

  • Pro

Block or report LeleCheung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
9 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,256 11,065 Updated Nov 6, 2025

Fast and memory-efficient exact attention

Python 20,363 2,115 Updated Nov 5, 2025

⭐Github Ranking⭐ Github stars and forks ranking list. Github Top100 stars list of different languages. Automatically update daily. | Github仓库排名,每日自动更新

Python 9,167 554 Updated Nov 6, 2025

交易模块

Python 7,435 1,691 Updated Sep 10, 2025

A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.

Python 1,517 101 Updated Nov 4, 2024

NeuroCuts is a deep RL algorithm for generating optimized packet classification trees.

Python 75 25 Updated Jun 4, 2020

Artifacts for our ASPLOS'23 paper ElasticFlow

Python 55 6 Updated May 10, 2024

Discussion materials

Python 9 1 Updated Apr 24, 2024