Skip to content
@FastLM

FastLM

We develop fast, lightweighted LM in large-scale, distributed, parallel, sparsity senarios.

Popular repositories Loading

  1. tinyserve-vllm tinyserve-vllm Public

    [ACM MM 2025 Oral] TinyServe: Query-Aware Page Allocation Optimization

    Python 10 2

  2. CXL-SpecKV CXL-SpecKV Public

    [FPGA'26 Oral] CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving

    C++ 8 1

  3. CSV-Decode CSV-Decode Public

    CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model Inference

    Python 8

  4. SPI_VecDB SPI_VecDB Public

    Distributed Parallel Multi-Resolution Vector Search

    Go 8

  5. HSGM HSGM Public

    [ICPADS 2025 Oral, *SEM 2025 Oral] HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

    Python 7

  6. FastCache FastCache Public

    Forked from NoakLiu/FastCache-xDiT

    FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]

    Python 6

Repositories

Showing 10 of 12 repositories
  • tinyserve-vllm Public

    [ACM MM 2025 Oral] TinyServe: Query-Aware Page Allocation Optimization

    FastLM/tinyserve-vllm’s past year of commit activity
    Python 10 Apache-2.0 2 0 3 Updated Dec 8, 2025
  • HSGM Public

    [ICPADS 2025 Oral, *SEM 2025 Oral] HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

    FastLM/HSGM’s past year of commit activity
    Python 7 MIT 0 0 0 Updated Nov 23, 2025
  • CXL-SpecKV Public

    [FPGA'26 Oral] CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving

    FastLM/CXL-SpecKV’s past year of commit activity
    C++ 8 1 0 0 Updated Nov 23, 2025
  • SPI_VecDB Public

    Distributed Parallel Multi-Resolution Vector Search

    FastLM/SPI_VecDB’s past year of commit activity
    Go 8 Apache-2.0 0 0 0 Updated Nov 9, 2025
  • CogLoad Public

    Cognitive Load Traces

    FastLM/CogLoad’s past year of commit activity
    Python 1 0 0 0 Updated Nov 3, 2025
  • NeuroSpec Public

    Grammar- and Resource-Aligned Certifiable Speculative Decoding

    FastLM/NeuroSpec’s past year of commit activity
    Python 0 0 0 0 Updated Oct 31, 2025
  • CSV-Decode Public

    CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model Inference

    FastLM/CSV-Decode’s past year of commit activity
    Python 8 0 0 0 Updated Oct 30, 2025
  • PiKV Public Forked from NoakLiu/PiKV

    PiKV: KV Cache Management System for MoE [Efficient ML System]

    FastLM/PiKV’s past year of commit activity
    Python 4 7 0 0 Updated Oct 26, 2025
  • GraphSnapShot Public Forked from NoakLiu/GraphSnapShot

    GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]

    FastLM/GraphSnapShot’s past year of commit activity
    Python 2 5 0 0 Updated Sep 22, 2025
  • FastCache Public Forked from NoakLiu/FastCache-xDiT

    FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]

    FastLM/FastCache’s past year of commit activity
    Python 6 Apache-2.0 311 0 0 Updated Sep 22, 2025

Top languages

Python C++ Go

Most used topics

Loading…