🎯
Focusing
Lists (3)
Sort Name ascending (A-Z)
Stars
3
results
for forked starred repositories
Clear filter
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
1
Updated Apr 28, 2024
Dominic789654 / LMFlow
Forked from OptimalScale/LMFlowAn Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Language Model for All.
shizhediao / openai-cookbook
Forked from openai/openai-cookbookExamples and guides for using the OpenAI API