Skip to content
View 0Pinky0's full-sized avatar

Block or report 0Pinky0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
181 results for source starred repositories
Clear filter

NeurIPS 2025: Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

Python 2 Updated Oct 13, 2025

A Really Scalable RL Framework to 10k+ CPUs

Python 37 2 Updated Feb 29, 2024

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

Python 270 16 Updated Oct 22, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

1,995 111 Updated Nov 5, 2025

Clash官网各版本Clash下载地址及备份下载地址

68 3 Updated Mar 30, 2025
Jupyter Notebook 243 23 Updated Feb 18, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,382 1,367 Updated Jul 9, 2025

Writing AI Conference Papers: A Handbook for Beginners

2,970 103 Updated Jul 16, 2025

使用obs批量录制视频

Python 8 Updated Oct 11, 2024

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,622 80 Updated Oct 3, 2025

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 897 113 Updated Oct 22, 2025

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,977 167 Updated Aug 26, 2025

[ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis

Python 11 Updated Apr 27, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,700 7,520 Updated Jun 4, 2025

臸娥粂陆亩竟

JavaScript 10 2 Updated May 11, 2024

Windows Subsystem for Linux

C++ 30,276 1,509 Updated Nov 6, 2025

A deep reinforcement learning agent learning to play the classical game of Tetris.

Python 4 1 Updated Dec 12, 2020

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Python 159 12 Updated Feb 18, 2025

一款自动为你的色图进行哈基米马赛克处理的打码器😎再也不用担心家里请不到高人了|自动哈基米打码器

Python 278 15 Updated Jun 18, 2025

An reconstruction of RL Introduction and its course materials for a more efficient entry

16 3 Updated May 23, 2025

A webpage centered around Rain World and its many threat themes.

JavaScript 16 6 Updated Nov 2, 2025

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

Python 841 67 Updated Nov 4, 2025

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 654 63 Updated Apr 20, 2025

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 647 169 Updated Sep 24, 2024

This package contains the original 2012 AlexNet code.

Cuda 2,765 358 Updated Mar 12, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 368 42 Updated Oct 29, 2025

麦麦bot,一款专注于 群组聊天 的赛博网友(比较专注)多平台智能体

Python 3,532 382 Updated Nov 6, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,563 1,177 Updated Nov 6, 2025

Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc

1,042 22 Updated Oct 27, 2025
Next