Skip to content
View goodgzm's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report goodgzm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 10 Updated Feb 6, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,187 8,174 Updated Feb 10, 2026

A framework for few-shot evaluation of language models.

Python 11,399 3,031 Updated Feb 11, 2026

LLM Can Get "Brain Rot"

Python 158 10 Updated Jan 9, 2026

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 710 134 Updated May 18, 2024

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 365 82 Updated Sep 2, 2025
Python 7 1 Updated Oct 10, 2024

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,722 296 Updated Sep 8, 2022

Python Multi-Agent Reinforcement Learning framework

Python 2,157 409 Updated Dec 8, 2022
Shell 1,881 523 Updated Jan 19, 2026

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,530 185 Updated Feb 11, 2026

PyTorch implementation of FQF, IQN and QR-DQN.

Python 188 32 Updated Jul 25, 2024

Installer Microsoft Office For MacOS

5,934 780 Updated Feb 5, 2026

Rainbow: Combining Improvements in Deep Reinforcement Learning

Python 1,660 293 Updated Jan 13, 2022

Mastering Diverse Domains through World Models

Python 2,789 461 Updated Sep 23, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,375 697 Updated Feb 4, 2026

chinese translation of llm-course

Jupyter Notebook 309 35 Updated Apr 3, 2024

我的AI学习笔记。包括b站up主deep_thoughts的PyTorch课程笔记和相关代码;北邮深度学习与数字视频PPT代码。

Jupyter Notebook 43 8 Updated Jun 18, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 39,360 4,750 Updated Feb 6, 2026

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,240 113 Updated Jan 16, 2026

Recipes to train the self-rewarding reasoning LLMs.

Python 231 12 Updated Mar 2, 2025

An open source flight dynamics & control software library

C++ 1,891 540 Updated Feb 10, 2026

An environment based on JSBSIM aimed at one-to-one close air combat.

Python 450 134 Updated May 19, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 11,587 2,433 Updated Aug 5, 2024

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,512 210 Updated Mar 13, 2025

API to run VirtualHome, a Multi-Agent Household Simulator

Python 598 86 Updated Jun 10, 2025

A library for advanced large language model reasoning

Python 2,329 205 Updated Jun 10, 2025

StarCraft II Learning Environment

Python 8,256 1,170 Updated Jul 23, 2024

SMAC: The StarCraft Multi-Agent Challenge

Python 1,325 236 Updated Feb 18, 2024
Next