Skip to content
View UMXihao's full-sized avatar

Block or report UMXihao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. PowerInfer PowerInfer Public

    Forked from SJTU-IPADS/PowerInfer

    High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

    C++

  2. LLMLingua LLMLingua Public

    Forked from microsoft/LLMLingua

    [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

    Python

  3. SparseLLM SparseLLM Public

    Forked from BaiTheBest/SparseLLM

    Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

    Python

  4. litgpt litgpt Public

    Forked from Lightning-AI/litgpt

    20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

    Python

  5. LLaMA-Factory LLaMA-Factory Public

    Forked from hiyouga/LLaMA-Factory

    Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

    Python

  6. lo-fit lo-fit Public

    Forked from fc2869/lo-fit

    LoFiT: Localized Fine-tuning on LLM Representations

    Python