Skip to content
View conglongli's full-sized avatar

Organizations

@efficient @google-deepmind @viscloud @llm-jp

Block or report conglongli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 485 57 Updated Apr 19, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,241 370 Updated Aug 14, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,028 4,780 Updated Apr 10, 2026

Example models using DeepSpeed

Python 6,815 1,122 Updated Mar 30, 2026

Source code for SIGMOD 2020 paper "Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination"

C++ 60 12 Updated Jul 17, 2020