yanring

Zijie Yan yanring

LLM Training System at NVIDIA Megatron Core MoE

139 followers · 52 following

Achievements

x3 x2

Achievements

x3 x2

Stars

162 results for source starred repositories

Clear filter

jwasham / coding-interview-university

A complete computer science study plan to become a software engineer.

332,736 81,157 Updated Aug 28, 2025

996icu / 996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

274,747 21,022 Updated Aug 22, 2025

practical-tutorials / project-based-learning

Curated list of project-based tutorials

249,161 32,588 Updated Aug 15, 2024

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 109,850 11,435 Updated Nov 7, 2025

deepseek-ai / DeepSeek-V3

Python 100,188 16,323 Updated Aug 28, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,790 25,823 Updated Nov 7, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,176 6,506 Updated Sep 19, 2025

scutan90 / DeepLearning-500-questions

JavaScript 56,785 15,973 Updated Jun 26, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,225 4,540 Updated Nov 7, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,632 4,613 Updated Nov 7, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,697 5,060 Updated Nov 6, 2025

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,912 3,233 Updated Nov 7, 2025

pcottle / learnGitBranching

An interactive git visualization and tutorial. Aspiring students of git can use this app to educate and challenge themselves towards mastery of git!

JavaScript 32,732 5,919 Updated Oct 21, 2025

mli / paper-reading

深度学习经典、新论文逐段精读

31,870 2,740 Updated Mar 22, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,515 6,481 Updated Nov 7, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

29,350 2,399 Updated Jun 18, 2024

datawhalechina / pumpkin-book

《机器学习》（西瓜书）公式详解

25,393 4,804 Updated Jun 12, 2025

gpakosz / .tmux

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,741 3,516 Updated Nov 7, 2025

WeNeedHome / SummaryOfLoanSuspension

全国各省市停贷通知汇总

HTML 20,620 2,139 Updated Jul 13, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,393 2,121 Updated Nov 5, 2025

wangeditor-team / wangEditor

wangEditor, open-source Web rich text editor 开源 Web 富文本编辑器

TypeScript 18,181 3,351 Updated Oct 11, 2024

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,499 2,364 Updated Nov 7, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,459 1,288 Updated Oct 6, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,057 3,181 Updated Nov 7, 2025

spf13 / spf13-vim

The ultimate vim distribution

Vim Script 15,562 3,586 Updated Nov 4, 2023

horovod / horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,625 2,257 Updated Nov 2, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,125 3,252 Updated Nov 7, 2025

dask / dask

Parallel computing with task scheduling

Python 13,579 1,818 Updated Nov 7, 2025

ttroy50 / cmake-examples

Useful CMake Examples

CMake 13,004 2,549 Updated Feb 28, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,902 856 Updated Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zijie Yan yanring

Achievements

Achievements

Block or report yanring

Stars

jwasham / coding-interview-university

996icu / 996.ICU

practical-tutorials / project-based-learning

excalidraw / excalidraw

deepseek-ai / DeepSeek-V3

pytorch / pytorch

labmlai / annotated_deep_learning_paper_implementations

scutan90 / DeepLearning-500-questions

hpcaitech / ColossalAI

deepspeedai / DeepSpeed

huggingface / pytorch-image-models

jax-ml / jax

pcottle / learnGitBranching

mli / paper-reading

huggingface / diffusers

google-research / tuning_playbook

datawhalechina / pumpkin-book

gpakosz / .tmux

WeNeedHome / SummaryOfLoanSuspension

Dao-AILab / flash-attention

wangeditor-team / wangEditor

triton-lang / triton

openai / tiktoken

NVIDIA-NeMo / NeMo

spf13 / spf13-vim

horovod / horovod

NVIDIA / Megatron-LM

dask / dask

ttroy50 / cmake-examples

microsoft / LoRA