Highlights
Lists (15)
Sort Name ascending (A-Z)
Stars
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
使用盲水印保护创作者的知识产权using invisible watermark to protect creator's intellectual property
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.
The official implementation of Self-Play Fine-Tuning (SPIN)
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
This is an vault template for researchers using obsidian.
BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Benchmarking Generalized Out-of-Distribution Detection
🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy
A Light CNN for Deep Face Representation with Noisy Labels, TIFS 2018
High throughput synchronous and asynchronous reinforcement learning
Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
A suite of test scenarios for multi-agent reinforcement learning.
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019
Optimizing AlphaFold Training and Inference on GPU Clusters
Create, manipulate and convert representations of position and orientation in 2D or 3D using Python
A tool for enriching the output of nvidia-smi.