Skip to content
View nbei's full-sized avatar
🎯
One more chance
🎯
One more chance

Block or report nbei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Low Latency inference for sliding window LSTMs

Cuda 10 2 Updated Apr 10, 2026

App for Claude Code / Codex / Gemini / OpenCode, vibe coding anytime, anywhere

TypeScript 3,798 415 Updated Apr 29, 2026

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,683 662 Updated Apr 30, 2026

A library of reinforcement learning components and agents

Python 3,979 535 Updated Apr 8, 2026

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,330 133 Updated Apr 30, 2026

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python 837 145 Updated Nov 29, 2022

An elegant PyTorch deep reinforcement learning library.

Python 10,618 1,300 Updated Apr 3, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,384 7,517 Updated Apr 30, 2026

Scalable toolkit for efficient model reinforcement

Python 1,586 358 Updated Apr 30, 2026

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 502 34 Updated Nov 19, 2025

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,786 595 Updated Apr 23, 2026

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 13,189 2,117 Updated Apr 19, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,032 3,777 Updated Apr 30, 2026

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,377 390 Updated Jun 11, 2025

FAIR Sequence Modeling Toolkit 2

Python 1,128 140 Updated Apr 27, 2026

High-Performance Symbolic Regression in Python and Julia

Python 3,526 326 Updated Apr 27, 2026

A collection of AWESOME things about mixture-of-experts

1,276 87 Updated Dec 8, 2024

Implementations of a Mixture-of-Experts (MoE) architecture designed for research on large language models (LLMs) and scalable neural network designs. One implementation targets a **single-device/NP…

Python 68 7 Updated Apr 8, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,921 306 Updated Jan 16, 2024

A PyTorch native platform for training generative AI models

Python 5,286 802 Updated Apr 30, 2026

[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 489 29 Updated Dec 6, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,439 773 Updated Apr 21, 2026

Best practice for training LLaMA models in Megatron-LM

Python 664 57 Updated Jan 2, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,152 3,401 Updated Apr 30, 2026

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 857 179 Updated Mar 10, 2026

A Library for Advanced Deep Time Series Models for General Time Series Analysis.

Python 12,169 1,925 Updated Apr 18, 2026

🚀 Efficient implementations for emerging model architectures

Python 5,016 515 Updated Apr 30, 2026

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 796 52 Updated Apr 21, 2026

MAGI-1: Autoregressive Video Generation at Scale

Python 3,683 237 Updated Jun 17, 2025
Rust 1 Updated Jun 26, 2025
Next