-
Beihang University
- Beijing, China
-
09:43
(UTC -12:00)
Highlights
- Pro
Lists (5)
Sort Name ascending (A-Z)
Stars
[TNNLS-2024, arXiv-2023.2.10] Official repository of "A Survey on Causal Reinforcement Learning"
Paper list of multi-agent reinforcement learning (MARL)
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…
Paper Debugger is the best overleaf companion
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
Accelerated Quality-Diversity
DeepSeek Coder: Let the Code Write Itself
Multi/Single UAV(unmanned aerial vehicle) path planning based on deep reinforcement learning
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Matplotlib styles for scientific plotting
LTeX+: Grammar/spell checker 🔍✔️ for VS Code using LanguageTool with support for LaTeX 🎓, Markdown 📝, and others
A python module to repair invalid JSON from LLMs
使用 Typst 编写的中文简历, 语法简洁, 样式美观, 开箱即用, 可选是否显示照片
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Official inference framework for 1-bit LLMs
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
轻松访问校内网络资源,无需繁琐设置,只需粘贴链接,常规网址即刻转化为您学校的Web VPN网址。
🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" (ICML 2024).
Source code for the X Recommendation Algorithm
Microsoft PowerToys is a collection of utilities that help you customize Windows and streamline everyday tasks
Muon is an optimizer for hidden layers in neural networks
LLM agents built for control. Designed for real-world use. Deployed in minutes.
rl from zero pretrain, can it be done? yes.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。