Popular repositories Loading
-
-
-
FlashOverlap
FlashOverlap PublicA lightweight design for computation-communication overlap.
Repositories
Showing 10 of 20 repositories
- Infini-Memory Public
Infini-Memory: A maintainable, text-based persistent memory architecture that organizes LLM agent memory as topic-structured documents. paper: https://arxiv.org/abs/2606.10677
infinigence/Infini-Memory’s past year of commit activity - FUSCO Public
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
infinigence/FUSCO’s past year of commit activity - Semi-PD Public
A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.
infinigence/Semi-PD’s past year of commit activity - HamiltonAttention Public
infinigence/HamiltonAttention’s past year of commit activity - Infini-Megrez Public
infinigence/Infini-Megrez’s past year of commit activity - STAlloc Public
infinigence/STAlloc’s past year of commit activity - llama.cpp Public
infinigence/llama.cpp’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…