Skip to content
View nzw0301's full-sized avatar

Organizations

@apache @optuna

Block or report nzw0301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2 Updated Feb 10, 2026
Python 229 9 Updated Mar 9, 2026

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

Python 40 1 Updated May 30, 2025
Python 12 1 Updated Mar 12, 2026

Modular, scalable library to train ML models

Python 228 24 Updated Mar 26, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 614 67 Updated Mar 24, 2026

MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI training and inference, such as FP8 row-wise quantization and …

Python 87 35 Updated Mar 26, 2026

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 193 20 Updated Jan 12, 2026

Async RL Training at Scale

Python 1,182 239 Updated Mar 26, 2026

Kernel sources for https://huggingface.co/kernels-community

C++ 83 26 Updated Mar 26, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 384 40 Updated Mar 25, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,224 3,502 Updated Mar 26, 2026
Python 327 32 Updated Jul 25, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 347 40 Updated Mar 24, 2026

Train transformer language models with reinforcement learning.

Python 17,795 2,589 Updated Mar 26, 2026

A benchmark to evaluate search-augmented LLMs

Python 17 2 Updated Aug 28, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,485 245 Updated Mar 26, 2026
Python 1 Updated Oct 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,927 790 Updated Mar 11, 2026

Tool for generating high quality Synthetic datasets

Python 1,541 217 Updated Oct 28, 2025
Python 134 28 Updated Jan 22, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,948 2,064 Updated Mar 26, 2026

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 117 18 Updated Jul 27, 2025
Python 167 7 Updated Aug 18, 2025

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 41,249 2,627 Updated Mar 26, 2026
Python 116 19 Updated Jan 4, 2026

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 8 Updated Jul 23, 2025

An action for automatically labelling pull requests

TypeScript 2,412 478 Updated Mar 25, 2026
Next