-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedSep 27, 2025 -
llvm-project Public
Forked from llvm/llvm-projectThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
LLVM Other UpdatedSep 29, 2024 -
torchchat Public
Forked from pytorch/torchchatRun PyTorch LLMs locally on servers, desktop and mobile
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 26, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedSep 25, 2024 -
llm-analysis Public
Forked from cli99/llm-analysisLatency and Memory Analysis of Transformer Models for Training and Inference
Python Apache License 2.0 UpdatedMay 13, 2024 -
-