nbei

🎯

One more chance

Rui Xu nbei

🎯

One more chance

PhD in MMLab, CUHK

221 followers · 14 following

nbei.github.io

Achievements

x2 x3

Achievements

x2 x3

Stars

NVIDIA / dl-lowlat-infer

Low Latency inference for sliding window LSTMs

Cuda 10 2 Updated Apr 10, 2026

tiann / hapi

App for Claude Code / Codex / Gemini / OpenCode, vibe coding anytime, anywhere

TypeScript 3,798 415 Updated Apr 29, 2026

NVIDIA / DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,683 662 Updated Apr 30, 2026

google-deepmind / acme

A library of reinforcement learning components and agents

Python 3,979 535 Updated Apr 8, 2026

sail-sg / envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,330 133 Updated Apr 30, 2026

google-research / seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python 837 145 Updated Nov 29, 2022

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 10,618 1,300 Updated Apr 3, 2026

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,384 7,517 Updated Apr 30, 2026

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,586 358 Updated Apr 30, 2026

deepseek-ai / LPLB

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 502 34 Updated Nov 19, 2025

DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,786 595 Updated Apr 23, 2026

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 13,189 2,117 Updated Apr 19, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,032 3,777 Updated Apr 30, 2026

XinJingHao / DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,377 390 Updated Jun 11, 2025

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 1,128 140 Updated Apr 27, 2026

MilesCranmer / PySR

High-Performance Symbolic Regression in Python and Julia

Python 3,526 326 Updated Apr 27, 2026

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

1,276 87 Updated Dec 8, 2024

junfanz1 / MoE-Mixture-of-Experts-in-PyTorch

Implementations of a Mixture-of-Experts (MoE) architecture designed for research on large language models (LLMs) and scalable neural network designs. One implementation targets a **single-device/NP…

Python 68 7 Updated Apr 8, 2025

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,921 306 Updated Jan 16, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,286 802 Updated Apr 30, 2026

End2End-Diffusion / REPA-E

[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 489 29 Updated Dec 6, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,439 773 Updated Apr 21, 2026

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 664 57 Updated Jan 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rui Xu nbei

Achievements

Achievements

Block or report nbei

Stars

NVIDIA / dl-lowlat-infer

tiann / hapi

NVIDIA / DALI

google-deepmind / acme

sail-sg / envpool

google-research / seed_rl

thu-ml / tianshou

ray-project / ray

NVIDIA-NeMo / RL

deepseek-ai / LPLB

DLR-RM / rl-baselines3-zoo

DLR-RM / stable-baselines3

verl-project / verl

XinJingHao / DRL-Pytorch

facebookresearch / fairseq2

MilesCranmer / PySR

XueFuzhao / awesome-mixture-of-experts

junfanz1 / MoE-Mixture-of-Experts-in-PyTorch

deepseek-ai / DeepSeek-MoE

pytorch / torchtitan

End2End-Diffusion / REPA-E

facebookresearch / xformers

alibaba / Megatron-LLaMA

NVIDIA-NeMo / NeMo

Dao-AILab / causal-conv1d

thuml / Time-Series-Library

fla-org / flash-linear-attention

SandAI-org / MagiAttention

SandAI-org / MAGI-1

carefree0910 / carefree-pyo3