Skip to content
View 0Pinky0's full-sized avatar

Block or report 0Pinky0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NeurIPS 2025: Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

Python 2 Updated Oct 13, 2025

A Really Scalable RL Framework to 10k+ CPUs

Python 36 2 Updated Feb 29, 2024

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

Python 263 16 Updated Oct 22, 2025

SoftVC VITS Singing Voice Conversion

Python 27,741 5,069 Updated Nov 11, 2023

A Survey of Reinforcement Learning for Large Reasoning Models

1,987 111 Updated Nov 5, 2025
Jupyter Notebook 919 108 Updated Jun 27, 2024

Clash官网各版本Clash下载地址及备份下载地址

68 3 Updated Mar 30, 2025

Code release for ConvNeXt model

Python 6,178 725 Updated Jan 8, 2023
Jupyter Notebook 243 23 Updated Feb 18, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,382 1,367 Updated Jul 9, 2025

Writing AI Conference Papers: A Handbook for Beginners

2,967 104 Updated Jul 16, 2025

使用obs批量录制视频

Python 8 Updated Oct 11, 2024

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,618 79 Updated Oct 3, 2025

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 895 113 Updated Oct 22, 2025

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,976 166 Updated Aug 26, 2025

[ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis

Python 11 Updated Apr 27, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,645 7,514 Updated Jun 4, 2025

臸娥粂陆亩竟

JavaScript 10 2 Updated May 11, 2024

Windows Subsystem for Linux

C++ 30,269 1,508 Updated Nov 5, 2025

A deep reinforcement learning agent learning to play the classical game of Tetris.

Python 4 1 Updated Dec 12, 2020

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Python 158 12 Updated Feb 18, 2025

一款自动为你的色图进行哈基米马赛克处理的打码器😎再也不用担心家里请不到高人了|自动哈基米打码器

Python 277 15 Updated Jun 18, 2025

An reconstruction of RL Introduction and its course materials for a more efficient entry

16 3 Updated May 23, 2025

A webpage centered around Rain World and its many threat themes.

JavaScript 16 6 Updated Nov 2, 2025

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

Python 840 66 Updated Nov 4, 2025

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 654 63 Updated Apr 20, 2025

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 646 169 Updated Sep 24, 2024

This package contains the original 2012 AlexNet code.

Cuda 2,762 356 Updated Mar 12, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 366 42 Updated Oct 29, 2025
Next