Skip to content
View DachengLi1's full-sized avatar
😄
😄

Sponsors

Highlights

  • Pro

Block or report DachengLi1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
7 results for forked starred repositories
Clear filter

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,186 365 Updated Aug 14, 2025

Fast and memory-efficient exact attention

Python 198 67 Updated Oct 20, 2025

Transformers at any scale

Python 41 1 Updated Jan 18, 2024

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Python 2 Updated Sep 20, 2021

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyp…

Python 3 1 Updated Jan 21, 2023

PyTorch differentiable Multi-Scale Structural Similarity (MS-SSIM) loss

Python 461 68 Updated Aug 15, 2025
JavaScript 1 Updated Oct 20, 2019