Skip to content
@infinigence

Infinigence

Popular repositories Loading

  1. Infini-Megrez Infini-Megrez Public

    339 20

  2. Infini-Megrez-Omni Infini-Megrez-Omni Public

    Python 241 9

  3. FlashOverlap FlashOverlap Public

    A lightweight design for computation-communication overlap.

    Cuda 198 9

  4. Semi-PD Semi-PD Public

    A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

    Python 119 15

  5. LVEval LVEval Public

    Repository of LV-Eval Benchmark

    Python 73 10

  6. SpecEE SpecEE Public

    Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)

    C++ 69 9

Repositories

Showing 10 of 17 repositories

Top languages

Loading…

Most used topics

Loading…