Skip to content
View nzw0301's full-sized avatar

Organizations

@apache @optuna

Block or report nzw0301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2 Updated Feb 10, 2026
Python 222 9 Updated Mar 9, 2026

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

Python 40 1 Updated May 30, 2025
Python 12 1 Updated Mar 12, 2026

Modular, scalable library to train ML models

Python 227 24 Updated Mar 20, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 612 65 Updated Mar 17, 2026

MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI training and inference, such as FP8 row-wise quantization and …

Python 87 33 Updated Mar 21, 2026

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 191 20 Updated Jan 12, 2026

Async RL Training at Scale

Python 1,166 234 Updated Mar 22, 2026

Kernel sources for https://huggingface.co/kernels-community

C++ 81 26 Updated Mar 20, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 382 39 Updated Mar 20, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,097 3,476 Updated Mar 21, 2026
Python 327 32 Updated Jul 25, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 344 40 Updated Mar 21, 2026

Train transformer language models with reinforcement learning.

Python 17,742 2,576 Updated Mar 21, 2026

A benchmark to evaluate search-augmented LLMs

Python 17 2 Updated Aug 28, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,464 239 Updated Mar 22, 2026
Python 1 Updated Oct 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,891 779 Updated Mar 11, 2026

Tool for generating high quality Synthetic datasets

Python 1,540 215 Updated Oct 28, 2025
Python 134 27 Updated Jan 22, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,930 2,064 Updated Jan 13, 2026

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 117 18 Updated Jul 27, 2025
Python 167 7 Updated Aug 18, 2025

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 41,185 2,612 Updated Mar 22, 2026
Python 115 19 Updated Jan 4, 2026

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 8 Updated Jul 23, 2025

An action for automatically labelling pull requests

TypeScript 2,407 478 Updated Mar 20, 2026
Next