Kun0913

Follow

🫠

Kun Kun0913

🫠

Follow

10 followers · 26 following

Beihang University
Beijing, China
09:43 (UTC -12:00)

Highlights

Pro

Lists (5)

Sort

Daily tools

36 repositories

LLM

15 repositories

Paper with Code :-)

28 repositories

RL Env

24 repositories

RL Tools

Open source code and utility framework

38 repositories

Stars

libo-huang / Awesome-Causal-Reinforcement-Learning

[TNNLS-2024, arXiv-2023.2.10] Official repository of "A Survey on Causal Reinforcement Learning"

179 4 Updated Dec 8, 2025

LantaoYu / MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

4,638 764 Updated Nov 19, 2025

facebookresearch / BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 535 109 Updated Nov 10, 2025

PaperDebugger / paperdebugger

Paper Debugger is the best overleaf companion

TypeScript 1,140 51 Updated Dec 17, 2025

DayuanJiang / next-ai-draw-io

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 12,681 1,268 Updated Dec 17, 2025

adaptive-intelligent-robotics / QDax

Accelerated Quality-Diversity

Python 332 56 Updated Oct 30, 2025

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 705 134 Updated Dec 9, 2025

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python 22,504 2,682 Updated Nov 11, 2025

henbudidiao / UAV-path-planning

Multi/Single UAV(unmanned aerial vehicle) path planning based on deep reinforcement learning

Python 604 44 Updated Aug 27, 2025

iamlilAJ / Pre-Strategy-Intervention

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

Python 6 Updated Nov 24, 2025

garrettj403 / SciencePlots

Matplotlib styles for scientific plotting

Python 8,450 781 Updated Nov 20, 2025

ltex-plus / vscode-ltex-plus

LTeX+: Grammar/spell checker 🔍✔️ for VS Code using LanguageTool with support for LaTeX 🎓, Markdown 📝, and others

TypeScript 188 8 Updated Dec 2, 2025

mangiucugna / json_repair

A python module to repair invalid JSON from LLMs

Python 4,171 161 Updated Dec 17, 2025

OrangeX4 / Chinese-Resume-in-Typst

使用 Typst 编写的中文简历, 语法简洁, 样式美观, 开箱即用, 可选是否显示照片

Typst 677 67 Updated Mar 18, 2025

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 22,656 2,745 Updated Jun 12, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,739 4,217 Updated Dec 14, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,461 1,914 Updated Jun 3, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 22,337 2,612 Updated Dec 3, 2025

huggingface / jat

General multi-task deep RL Agent

Python 186 14 Updated Jun 6, 2024

lcandy2 / webvpn-converter

轻松访问校内网络资源，无需繁琐设置，只需粘贴链接，常规网址即刻转化为您学校的Web VPN网址。

TypeScript 191 32 Updated Dec 13, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,537 2,282 Updated Oct 17, 2025

adaptive-intelligent-robotics / QDAC

Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" (ICML 2024).

Python 20 2 Updated Jun 16, 2024

twitter / the-algorithm

Source code for the X Recommendation Algorithm

Scala 67,969 12,645 Updated Sep 8, 2025

microsoft / PowerToys

Microsoft PowerToys is a collection of utilities that help you customize Windows and streamline everyday tasks

C# 126,555 7,541 Updated Dec 17, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,108 99 Updated Nov 23, 2025

emcie-co / parlant

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 16,712 1,404 Updated Dec 17, 2025

tokenbender / avataRL

rl from zero pretrain, can it be done? yes.

Python 282 22 Updated Sep 28, 2025

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,917 1,214 Updated Dec 16, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,111 7,770 Updated Dec 16, 2025

1Panel-dev / MaxKB

🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。

Python 19,570 2,551 Updated Dec 17, 2025