Skip to content
View hengyuan-hu's full-sized avatar

Block or report hengyuan-hu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Oct 3, 2025
Python 72 10 Updated Apr 8, 2026

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,427 166 Updated Apr 17, 2025
Python 39 6 Updated May 13, 2026

Public release for paper Policy Learning with a Language Bottleneck (TMLR 2026, RLC TAFM Spotlight).

Python 5 Updated Mar 3, 2026
Python 73 10 Updated Sep 23, 2024

Pyrallis is a framework for structured configuration parsing from both cmd and files. Simply define your desired configuration structure as a dataclass and let pyrallis do the rest!

Python 257 6 Updated Mar 1, 2026
Python 13 Updated Feb 25, 2025
Python 16 2 Updated Feb 23, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 13,003 1,239 Updated Jun 3, 2026

Code for magnetic mirror descent.

Python 20 4 Updated Oct 5, 2023

Inference code for Llama models

Python 59,461 9,790 Updated Jan 26, 2025

Karabiner-Elements is a powerful tool for customizing keyboards on macOS

C++ 22,354 914 Updated Jun 21, 2026

Development repository for the Triton language and compiler

MLIR 19,496 2,952 Updated Jun 22, 2026

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 300 34 Updated Jun 18, 2024

A library for distributed ML training with PyTorch

C++ 365 22 Updated Dec 12, 2022

Implementation of the Off Belief Learning algorithm.

Python 49 8 Updated Aug 18, 2022

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Python 519 68 Updated Feb 12, 2025

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 3,196 343 Updated May 18, 2026

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 3,018 165 Updated Jul 9, 2025

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,623 149 Updated Jan 14, 2026

An example bot for the Hanabi Live website written in Python

Python 5 9 Updated Apr 29, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 100,951 28,082 Updated Jun 22, 2026

The project is a platform of zero learning with a library of games.

C++ 266 55 Updated Oct 12, 2021

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

Python 102 33 Updated Jun 22, 2022

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

C++ 1,167 221 Updated Mar 27, 2026

Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it

C++ 129 29 Updated Jul 18, 2023

Reinforcement Learning Assembly

C++ 94 8 Updated Sep 2, 2021

A PyTorch Platform for Distributed RL

Python 754 113 Updated Sep 15, 2021
Next