-
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python Other UpdatedFeb 26, 2025 -
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
GNU General Public License v3.0 UpdatedOct 15, 2024 -
EvolKit Public
Forked from arcee-ai/EvolKitEvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
Jupyter Notebook MIT License UpdatedSep 10, 2024 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedSep 10, 2024 -
MambaInLlama Public
Forked from jxiw/MambaInLlamaOfficial Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
Python Apache License 2.0 UpdatedAug 28, 2024 -
phi-mamba Public
Forked from goombalab/phi-mambaOfficial implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)
Python UpdatedAug 20, 2024 -
DSKD Public
Forked from songmzhang/DSKDRepo for Paper "Dual-Space Knowledge Distillation for Large Language Models".
Python UpdatedAug 13, 2024 -
text-generation-inference Public
Forked from huggingface/text-generation-inferenceLarge Language Model Text Generation Inference
Python Apache License 2.0 UpdatedJul 6, 2023 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM training code for MosaicML foundation models
Python Apache License 2.0 UpdatedJul 6, 2023 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 6, 2023 -
gptq Public
Forked from IST-DASLab/gptqCode for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers"
Python UpdatedFeb 20, 2023 -
snorkel Public
Forked from snorkel-team/snorkelA training data creation and management system focused on information extraction
Python Apache License 2.0 UpdatedMay 25, 2017 -
rpmalloc Public
Forked from mjansson/rpmallocPublic domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C
Python The Unlicense UpdatedMar 19, 2017 -
parallel-xxhash Public
Forked from jsnell/parallel-xxhashCompute xxHash hash codes for 8 keys in parallel
C++ UpdatedMar 19, 2017 -
deepvoice Public
Forked from israelg99/deepvoiceDeep Voice: Real-time Neural Text-to-Speech
Python Apache License 2.0 UpdatedMar 15, 2017 -
-
junction Public
Forked from preshing/junctionConcurrent data structures in C++
C++ Other UpdatedJan 18, 2017 -
aws-s3-class-loader Public
Forked from RGBz/aws-s3-class-loaderA Java ClassLoader implementation that yanks classes directly from an Amazon Web Services S3 bucket.
Java UpdatedDec 5, 2013