Skip to content

gty111/gty111

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 

Repository files navigation

  • PH.D. student at Sun Yat-sen university

  • AI Infra, MLSys, Simulaters, GPU architecture

  • Visit my personal web

News

  • [2025/06/27] [arXiv] [Code] gLLM is accepted by SC'25. Congratulations!
  • [2025/05/28] [arXiv] [Code] EFIM is accepted by Euro-Par'25
  • [2025/04/27] [arXiv] [Code] We have released gLLM, an efficient pipeline parallelism inference engine for LLM.

PRs for Project

  • vLLM: [Bugfix] Fix benchmark_moe.py link
  • sglang: Fix port number overflow link
  • xDiT: Enable warm up for VAE link
  • xDiT: Fix parallel vae link
  • DistVAE: Fix batch dimension link
  • vLLM: [Benchmark] Refactor sample_requests in benchmark_throughput link
  • vLLM: [Bugfix] fix automatic prefix args and add log info link
  • vLLM: [Minor Fix] Fix comments in benchmark_serving link
  • vLLM: [Minor Fix] Remove unused code in benchmark_prefix_caching.py link
  • TVM: [Doc] Fix minor error in "Expressions in Relay" link
  • TVM: [Doc] Fix minor error in doc (Add an operator to Relay) link

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published