Skip to content
View goodgzm's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report goodgzm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.

Python 370 36 Updated Mar 16, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,491 9,454 Updated Nov 12, 2025

AI 味去除 - 仅在 Gemini 2.5 Pro 上测试通过

941 60 Updated Apr 2, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 333,407 64,989 Updated Mar 24, 2026
Python 12 Updated Mar 20, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 68,970 8,405 Updated Mar 21, 2026

A framework for few-shot evaluation of language models.

Python 11,825 3,122 Updated Mar 18, 2026

LLM Can Get "Brain Rot"

Python 161 10 Updated Jan 9, 2026

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 711 134 Updated May 18, 2024

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 369 82 Updated Sep 2, 2025
Python 8 1 Updated Oct 10, 2024

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,731 298 Updated Sep 8, 2022

Python Multi-Agent Reinforcement Learning framework

Python 2,168 411 Updated Dec 8, 2022
Shell 1,934 537 Updated Jan 19, 2026

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,550 188 Updated Mar 22, 2026

PyTorch implementation of FQF, IQN and QR-DQN.

Python 189 32 Updated Jul 25, 2024

Installer Microsoft Office For MacOS

6,179 830 Updated Mar 23, 2026

Rainbow: Combining Improvements in Deep Reinforcement Learning

Python 1,663 293 Updated Jan 13, 2022

Mastering Diverse Domains through World Models

Python 2,970 490 Updated Sep 23, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,947 748 Updated Feb 4, 2026

chinese translation of llm-course

Jupyter Notebook 320 35 Updated Apr 3, 2024

我的AI学习笔记。包括b站up主deep_thoughts的PyTorch课程笔记和相关代码;北邮深度学习与数字视频PPT代码。

Jupyter Notebook 43 8 Updated Jun 18, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 42,897 5,157 Updated Mar 24, 2026

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,242 113 Updated Jan 16, 2026

Recipes to train the self-rewarding reasoning LLMs.

Python 231 14 Updated Mar 2, 2025

An open source flight dynamics & control software library

C++ 1,954 553 Updated Mar 19, 2026

An environment based on JSBSIM aimed at one-to-one close air combat.

Python 459 139 Updated May 19, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 11,669 2,443 Updated Aug 5, 2024

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,545 212 Updated Mar 13, 2025
Next