2sin18

Yuan, Man 2sin18

Sapere aude!

50 followers · 56 following

Alibaba Group
Beijing

Achievements

Stars

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,199 71 Updated Oct 8, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 612 48 Updated Oct 9, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 7,007 891 Updated Aug 31, 2025

microsoft / mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 417 68 Updated Oct 9, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,920 285 Updated May 15, 2025

NVIDIA / nvidia-resiliency-ext

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 226 34 Updated Oct 7, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 87,400 13,262 Updated Oct 9, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,512 557 Updated Oct 9, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,766 516 Updated Oct 9, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,725 413 Updated Oct 8, 2025

arpita8 / Awesome-Mixture-of-Experts-Papers

Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.

134 5 Updated Aug 21, 2024

tgale96 / grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 124 77 Updated May 29, 2025

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,803 292 Updated Jan 16, 2024

databricks / megablocks

Python 1,460 216 Updated Jun 26, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,020 1,461 Updated Oct 8, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,470 1,623 Updated Sep 30, 2025

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,335 578 Updated Oct 6, 2025

ridgerchu / SpikeGPT

Implementation of "SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks"

Python 852 88 Updated Jul 21, 2025

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,006 952 Updated Sep 27, 2025

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,147 338 Updated Jun 30, 2025

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Python 116,895 19,235 Updated Oct 9, 2025

databrickslabs / dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,800 1,154 Updated Jun 30, 2023

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,191 1,210 Updated Oct 8, 2025

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,168 4,039 Updated Jul 17, 2024

anthropics / hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,787 145 Updated Jun 17, 2025

lucidrains / toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python 2,049 129 Updated Jul 22, 2024

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,522 3,006 Updated Oct 9, 2025

intel / PerTaskMemBWMonitoring

Python 11 7 Updated Jan 7, 2023

google / tensorstore

Library for reading and writing large multi-dimensional arrays.

C++ 1,451 131 Updated Oct 7, 2025

NVIDIA-Merlin / NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Python 1,100 144 Updated Sep 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuan, Man 2sin18

Achievements

Achievements

Block or report 2sin18

Stars

ByteDance-Seed / VeOmni

Dao-AILab / quack

GeeeekExplorer / nano-vllm

microsoft / mscclpp

deepseek-ai / open-infra-index

NVIDIA / nvidia-resiliency-ext

ggml-org / llama.cpp

pytorch / torchtitan

NVIDIA / TransformerEngine

linkedin / Liger-Kernel

arpita8 / Awesome-Mixture-of-Experts-Papers

tgale96 / grouped_gemm

deepseek-ai / DeepSeek-MoE

databricks / megablocks

state-spaces / mamba

QwenLM / Qwen

mosaicml / llm-foundry

ridgerchu / SpikeGPT

BlinkDL / RWKV-LM

microsoft / LMOps

langchain-ai / langchain

databrickslabs / dolly

huggingface / accelerate

tatsu-lab / stanford_alpaca

anthropics / hh-rlhf

lucidrains / toolformer-pytorch

lutzroeder / netron

intel / PerTaskMemBWMonitoring

google / tensorstore

NVIDIA-Merlin / NVTabular