Skip to content
View 2sin18's full-sized avatar
  • Alibaba Group
  • Beijing

Block or report 2sin18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,199 71 Updated Oct 8, 2025

A Quirky Assortment of CuTe Kernels

Python 612 48 Updated Oct 9, 2025

Nano vLLM

Python 7,007 891 Updated Aug 31, 2025

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 417 68 Updated Oct 9, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,920 285 Updated May 15, 2025

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 226 34 Updated Oct 7, 2025

LLM inference in C/C++

C++ 87,400 13,262 Updated Oct 9, 2025

A PyTorch native platform for training generative AI models

Python 4,512 557 Updated Oct 9, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,766 516 Updated Oct 9, 2025

Efficient Triton Kernels for LLM Training

Python 5,725 413 Updated Oct 8, 2025

Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.

134 5 Updated Aug 21, 2024

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 124 77 Updated May 29, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,803 292 Updated Jan 16, 2024
Python 1,460 216 Updated Jun 26, 2025

Mamba SSM architecture

Python 16,020 1,461 Updated Oct 8, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,470 1,623 Updated Sep 30, 2025

LLM training code for Databricks foundation models

Python 4,335 578 Updated Oct 6, 2025

Implementation of "SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks"

Python 852 88 Updated Jul 21, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,006 952 Updated Sep 27, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,147 338 Updated Jun 30, 2025

🦜🔗 Build context-aware reasoning applications

Python 116,895 19,235 Updated Oct 9, 2025

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,800 1,154 Updated Jun 30, 2023

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,191 1,210 Updated Oct 8, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,168 4,039 Updated Jul 17, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,787 145 Updated Jun 17, 2025

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python 2,049 129 Updated Jul 22, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,522 3,006 Updated Oct 9, 2025

Library for reading and writing large multi-dimensional arrays.

C++ 1,451 131 Updated Oct 7, 2025

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Python 1,100 144 Updated Sep 15, 2025
Next