Skip to content
View ymjiang's full-sized avatar

Organizations

@bytedance @dmlc

Block or report ymjiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Long-context Language Model Training by Core Attention Disaggregation

Python 97 7 Updated Apr 1, 2026
C++ 358 40 Updated Jan 28, 2026

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 68 9 Updated Mar 11, 2026

Official Repo for Open-Reasoner-Zero

Python 2,087 119 Updated Jun 2, 2025

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,905 123 Updated Jan 21, 2024

Microsoft Automatic Mixed Precision Library

Python 635 50 Updated Dec 1, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,720 197 Updated Jun 25, 2024

Fast and memory-efficient exact attention

Python 23,136 2,583 Updated Apr 4, 2026

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,743 484 Updated Jan 8, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,063 9,551 Updated Nov 12, 2025
Python 243 32 Updated Nov 9, 2022

Pipeline Parallelism for PyTorch

Python 785 88 Updated Aug 21, 2024
Python 220 25 Updated Aug 17, 2023

Intel® Performance Counter Monitor (Intel® PCM)

C++ 3,254 524 Updated Apr 1, 2026

A multi-party collaborative machine learning framework

Python 903 174 Updated Feb 20, 2026

Ongoing research training transformer models at scale

Python 15,912 3,784 Updated Apr 4, 2026

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,669 2,098 Updated Apr 16, 2024

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 4 4 Updated Nov 9, 2019

A lightweight parameter server interface

C++ 88 27 Updated Jan 13, 2023

A high performance and generic framework for distributed DNN training

Python 3,715 494 Updated Oct 3, 2023

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,685 2,248 Updated Dec 1, 2025