Skip to content
View ymjiang's full-sized avatar

Organizations

@bytedance @dmlc

Block or report ymjiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Long-context Language Model Training by Core Attention Disaggregation

Python 98 7 Updated Apr 7, 2026
C++ 360 40 Updated Jan 28, 2026

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 69 9 Updated Mar 11, 2026

Official Repo for Open-Reasoner-Zero

Python 2,091 119 Updated Jun 2, 2025

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,908 123 Updated Jan 21, 2024

Microsoft Automatic Mixed Precision Library

Python 636 50 Updated Dec 1, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,723 197 Updated Jun 25, 2024

Fast and memory-efficient exact attention

Python 23,379 2,615 Updated Apr 16, 2026

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,744 484 Updated Jan 8, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,726 9,704 Updated Nov 12, 2025
Python 243 32 Updated Nov 9, 2022

Pipeline Parallelism for PyTorch

Python 786 87 Updated Aug 21, 2024
Python 220 25 Updated Aug 17, 2023

Intel® Performance Counter Monitor (Intel® PCM)

C++ 3,258 523 Updated Apr 13, 2026

A multi-party collaborative machine learning framework

Python 903 175 Updated Feb 20, 2026

Ongoing research training transformer models at scale

Python 16,055 3,833 Updated Apr 16, 2026

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,683 2,095 Updated Apr 16, 2024

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 4 4 Updated Nov 9, 2019

A lightweight parameter server interface

C++ 88 27 Updated Jan 13, 2023

A high performance and generic framework for distributed DNN training

Python 3,715 492 Updated Oct 3, 2023

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,691 2,247 Updated Dec 1, 2025