Skip to content
View hr0nix's full-sized avatar

Block or report hr0nix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., Pi0, Pi0.5, GR00TN1.5. Fully open-sourced.

Python 332 32 Updated Apr 24, 2026

A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.

Python 167 9 Updated Nov 11, 2025

Structured Outputs

Python 13,954 710 Updated May 18, 2026
Python 166 22 Updated Dec 13, 2023

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,038 44 Updated May 31, 2024

Single-line inference of SOTA deep learning models

Python 28 2 Updated Jan 22, 2023

LoRA for arbitrary JAX models and functions

Python 144 6 Updated Feb 26, 2024

If it quacks like a tensor...

Python 59 5 Updated Nov 13, 2024

Library for reading and processing ML training data.

Python 745 81 Updated Jun 13, 2026

jax-triton contains integrations between JAX and OpenAI Triton

Python 462 57 Updated Jun 1, 2026

Infer.NET is a framework for running Bayesian inference in graphical models

C# 1,614 239 Updated Dec 8, 2025

YTsaurus is a scalable and fault-tolerant open-source big data platform.

C++ 2,195 207 Updated Jun 13, 2026
Python 110 7 Updated Jun 25, 2024

The agent engineering platform.

Python 139,206 23,077 Updated Jun 13, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,688 2,988 Updated Apr 14, 2026

Orbax provides common checkpointing and persistence utilities for JAX users

Python 518 98 Updated Jun 12, 2026

Nethack Learning Environment Wrapper for Language Interface

Python 42 9 Updated Sep 11, 2023
Python 944 69 Updated Jun 12, 2026

A toolkit for developing and comparing reinforcement learning algorithms.

Python 37,223 8,704 Updated Mar 26, 2026

Development repository for the Triton language and compiler

MLIR 19,434 2,937 Updated Jun 13, 2026

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 7,236 818 Updated Jun 8, 2026

High-quality implementations of standard and SOTA methods on a variety of tasks.

Python 1,576 220 Updated Jun 12, 2026

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,894 511 Updated Jun 8, 2026

Pyrallis is a framework for structured configuration parsing from both cmd and files. Simply define your desired configuration structure as a dataclass and let pyrallis do the rest!

Python 255 6 Updated Mar 1, 2026

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,818 3,407 Updated Aug 12, 2024
Python 1,428 101 Updated Jun 12, 2026

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Python 518 67 Updated Feb 12, 2025

The NetHack Learning Environment

C 983 130 Updated May 6, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,814 3,634 Updated Jun 13, 2026

Monte Carlo tree search in JAX

Python 2,631 209 Updated Jun 12, 2026
Next