Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and Adaptive Quantization"

Jupyter Notebook 39 3 Updated Aug 29, 2025

hao-ai-lab / Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Python 416 23 Updated Nov 16, 2024

xai-org / grok-1

Grok open release

Python 51,692 8,472 Updated Aug 30, 2024

awslabs / optimizing-multitask-training-through-dynamic-pipelines

Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Python 19 2 Updated Dec 8, 2023

JIA-Lab-research / Prompt-Highlighter

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Python 158 4 Updated Jul 23, 2024

Doubiiu / DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 3,001 245 Updated Sep 8, 2024

jasperzhong / GNNFlow

Distributed Deep Graph Learning Framework for Dynamic Graphs

Python 19 3 Updated Mar 25, 2024

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,913 124 Updated Jan 21, 2024

JIA-Lab-research / RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

Python 154 9 Updated Jan 2, 2024

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,933 2,486 Updated Jun 22, 2026

SymbioticLab / Oobleck

A resilient distributed training framework

Python 100 12 Updated Apr 11, 2024

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 929 168 Updated Dec 30, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,749 201 Updated Jun 25, 2024

plasma-umass / scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,455 437 Updated Jun 21, 2026

Yutong-Zhou-cv / Awesome-Multimodality

A Survey on multimodal learning research.

332 22 Updated Aug 22, 2023

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,542 190 Updated Apr 2, 2025

raywan-110 / AdaQP

Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training

Python 24 3 Updated Mar 1, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 4,107 321 Updated Aug 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chenyu Jiang chenyu-jiang

Achievements

Achievements

Highlights

Block or report chenyu-jiang

Stars

ALAGENT-HKU / x2strategy

ChandlerGuan / mercury_artifact

meituan-longcat / LongCat-Flash-Chat

jhpoelen / zenodo-upload

SJTU-IPADS / PhoenixOS

ByteDance-Seed / ByteCheckpoint

deepseek-ai / open-infra-index

verl-project / verl

cchan / tccl

microsoft / BitBLAS

JoeyYoung / adapcc

shawntan / scattermoe

tonyzhao-jt / LLM-PQ