Skip to content
View Kun0913's full-sized avatar
🫠
🫠
  • Beihang University
  • Beijing, China
  • 09:43 (UTC -12:00)

Highlights

  • Pro

Block or report Kun0913

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[TNNLS-2024, arXiv-2023.2.10] Official repository of "A Survey on Causal Reinforcement Learning"

179 4 Updated Dec 8, 2025

Paper list of multi-agent reinforcement learning (MARL)

4,638 764 Updated Nov 19, 2025

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 535 109 Updated Nov 10, 2025

Paper Debugger is the best overleaf companion

TypeScript 1,140 51 Updated Dec 17, 2025

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 12,681 1,268 Updated Dec 17, 2025

Accelerated Quality-Diversity

Python 332 56 Updated Oct 30, 2025

Multi-Agent Reinforcement Learning with JAX

Python 705 134 Updated Dec 9, 2025

DeepSeek Coder: Let the Code Write Itself

Python 22,504 2,682 Updated Nov 11, 2025

Multi/Single UAV(unmanned aerial vehicle) path planning based on deep reinforcement learning

Python 604 44 Updated Aug 27, 2025

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

Python 6 Updated Nov 24, 2025

Matplotlib styles for scientific plotting

Python 8,450 781 Updated Nov 20, 2025

LTeX+: Grammar/spell checker 🔍✔️ for VS Code using LanguageTool with support for LaTeX 🎓, Markdown 📝, and others

TypeScript 188 8 Updated Dec 2, 2025

A python module to repair invalid JSON from LLMs

Python 4,171 161 Updated Dec 17, 2025

使用 Typst 编写的中文简历, 语法简洁, 样式美观, 开箱即用, 可选是否显示照片

Typst 677 67 Updated Mar 18, 2025

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 22,656 2,745 Updated Jun 12, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,739 4,217 Updated Dec 14, 2025

Official inference framework for 1-bit LLMs

Python 24,461 1,914 Updated Jun 3, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,337 2,612 Updated Dec 3, 2025

General multi-task deep RL Agent

Python 186 14 Updated Jun 6, 2024

轻松访问校内网络资源,无需繁琐设置,只需粘贴链接,常规网址即刻转化为您学校的Web VPN网址。

TypeScript 191 32 Updated Dec 13, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,537 2,282 Updated Oct 17, 2025

Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" (ICML 2024).

Python 20 2 Updated Jun 16, 2024

Source code for the X Recommendation Algorithm

Scala 67,969 12,645 Updated Sep 8, 2025

Microsoft PowerToys is a collection of utilities that help you customize Windows and streamline everyday tasks

C# 126,555 7,541 Updated Dec 17, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,108 99 Updated Nov 23, 2025

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 16,712 1,404 Updated Dec 17, 2025

rl from zero pretrain, can it be done? yes.

Python 282 22 Updated Sep 28, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,917 1,214 Updated Dec 16, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,111 7,770 Updated Dec 16, 2025

🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。

Python 19,570 2,551 Updated Dec 17, 2025
Next