LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,699 281 Updated Nov 5, 2025

awslabs / optimizing-multitask-training-through-dynamic-pipelines

Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Python 20 2 Updated Dec 8, 2023

AmadeusChan / Awesome-LLM-System-Papers

609 30 Updated May 10, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

3,030 201 Updated Nov 5, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,186 365 Updated Aug 14, 2025

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,475 836 Updated Aug 8, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,892 856 Updated Dec 17, 2024

SDS-Lab / QW_Loss

A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)

Python 11 1 Updated May 20, 2024

SymbioticLab / FedScale

FedScale is a scalable and extensible open-source federated learning (FL) platform.

Python 402 120 Updated Dec 18, 2023

thu-pacman / SmartMoE-AE

ATC23 AE

Python 47 4 Updated May 11, 2023

codecaution / Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

648 44 Updated Oct 30, 2024

Raphael-Hao / brainstorm

Compiler for Dynamic Neural Networks

Python 46 2 Updated Nov 13, 2023

koayon / awesome-adaptive-computation

A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).

156 10 Updated Jan 1, 2025

laekov / fastmoe

A fast MoE impl for PyTorch

Python 1,810 196 Updated Feb 10, 2025

chengzhi-lu / psi-perf-ds

Go 4 1 Updated Aug 29, 2024

tkestack / gpu-manager

Go 888 238 Updated Apr 2, 2024

harshanarayana / kube-scheduler

Custom Python Scheduler for Kubernetes

Python 15 7 Updated Jan 25, 2020

ccfddl / ccf-deadlines

⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Rust 8,117 545 Updated Nov 5, 2025

dyweb / papers-notebook

📄 🇨🇳 📃 论文阅读笔记（分布式系统、虚拟化、机器学习）Papers Notebook (Distributed System, Virtualization, Machine Learning)

2,194 254 Updated Jun 1, 2022

stanford-futuredata / gavel

Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

Jupyter Notebook 130 32 Updated Jul 25, 2024

aymericdamien / TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Jupyter Notebook 43,752 14,843 Updated Jul 26, 2024

microsoft / hivedscheduler

Kubernetes Scheduler for Deep Learning

Go 262 39 Updated May 22, 2022

alibaba / GPU-scheduler-for-deep-learning

GPU-scheduler-for-deep-learning

C++ 210 36 Updated Nov 5, 2020

NVIDIA / gpu-monitoring-tools

Tools for monitoring NVIDIA GPUs on Linux

C 1,057 305 Updated Nov 2, 2021

AliyunContainerService / gpushare-scheduler-extender

GPU Sharing Scheduler for Kubernetes Cluster

Go 1,513 315 Updated Dec 29, 2023

kubeflow / kubebench

Repository for benchmarking

Jsonnet 79 36 Updated Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenyan Chen chenwenyan

Highlights

Block or report chenwenyan

Stars

RayeRen / acad-homepage.github.io

eth-easl / orion

goliaro / specinfer-ae

alibaba / llm-scheduling-artifact

ModelTC / LightLLM