-
13:01
(UTC +08:00)
Pinned Loading
-
MeanFlowQL
MeanFlowQL PublicOfficial code for AAAI 2026 paper (One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow)
-
Book-Mathematical-Foundation-of-Reinforcement-Learning
Book-Mathematical-Foundation-of-Reinforcement-Learning PublicForked from MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
MATLAB 1
-
torchjd
torchjd PublicForked from TorchJD/torchjd
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning).
Python
-
SwanHubX/SwanLab
SwanHubX/SwanLab Public⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…
If the problem persists, check the GitHub status page or contact support.