Popular repositories Loading
-
PowerInfer
PowerInfer PublicForked from SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
C++
-
LLMLingua
LLMLingua PublicForked from microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Python
-
SparseLLM
SparseLLM PublicForked from BaiTheBest/SparseLLM
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
Python
-
litgpt
litgpt PublicForked from Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Python
-
LLaMA-Factory
LLaMA-Factory PublicForked from hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python
-
lo-fit
lo-fit PublicForked from fc2869/lo-fit
LoFiT: Localized Fine-tuning on LLM Representations
Python
If the problem persists, check the GitHub status page or contact support.