Highlights
- Pro
Stars
AI agents running research on single-GPU nanochat training automatically
Constraint Satisfaction Problem Solver for Golang
Unity SDK for Radar, the leading geofencing and location tracking platform
Tzafon-WayPoint is a robust, scalable solution for managing large fleets of browser instances. WayPoint stands out with unmatched cold‑start speed—launching up to a 1000 browser per second on stand…
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A library of reinforcement learning components and agents
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforce…
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
behaviac is a framework of the game AI development, and it also can be used as a rapid game prototype design tool. behaviac supports the behavior tree, finite state machine and hierarchical task ne…
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.
Biological foundation modeling from molecular to genome scale
SP1 is a zero‑knowledge virtual machine that proves the correct execution of programs compiled for the RISC-V architecture.
a state-of-the-art-level open visual language model | 多模态预训练模型
Tutorials, tools, and more as related to reverse engineering video games.
luban是一个强大、易用、优雅、稳定的游戏配置解决方案。luban is a powerful, easy-to-use, elegant and stable game configuration solution.