-
Thoughtworks
- Singapore
- https://cemse.kaust.edu.sa/people/person/fangyuan-yu
-
mod_gpt Public
Modified GPT model pre-training for GPU poor
-
-
SkillZero Public
Forked from ZJU-REAL/SkillZeroOfficial code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"
Python Apache License 2.0 UpdatedApr 3, 2026 -
skydiscover Public
Forked from skydiscover-ai/skydiscoverAI-Driven Scientific and Algorithmic Discovery
Python Apache License 2.0 UpdatedMar 3, 2026 -
DreamDojo Public
Forked from NVIDIA/DreamDojoSource code of DreamDojo by the NVIDIA GEAR Team.
Python Apache License 2.0 UpdatedFeb 20, 2026 -
unitree_rl_mjlab Public
Forked from unitreerobotics/unitree_rl_mjlabThis is a repository for reinforcement learning implementation for Unitree robots, based on Mujoco.
Python Apache License 2.0 UpdatedJan 30, 2026 -
memory-maze Public
Forked from jurgisp/memory-mazeEvaluating long-term memory of reinforcement learning algorithms
Jupyter Notebook MIT License UpdatedJan 29, 2026 -
OpenEnv Public
Forked from meta-pytorch/OpenEnvAn interface library for RL post training with environments.
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 13, 2026 -
mHC-manifold-constrained-hyper-connections Public
Forked from tokenbender/mHC-manifold-constrained-hyper-connectionsimplementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
Python Apache License 2.0 UpdatedJan 4, 2026 -
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Python MIT License UpdatedNov 18, 2025 -
Metaworld Public
Forked from Farama-Foundation/MetaworldCollections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Python MIT License UpdatedNov 10, 2025 -
vlm-gym Public
Forked from sdan/vlm-gymRL gym for vision language models in JAX
Python Apache License 2.0 UpdatedOct 30, 2025 -
TextArena Public
Forked from TextArena/TextArenaA Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
Python MIT License UpdatedOct 29, 2025 -
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Jupyter Notebook Apache License 2.0 UpdatedOct 8, 2025 -
es-fine-tuning-paper Public
Forked from VsonicV/es-fine-tuning-paperThis repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"
Python Other UpdatedOct 7, 2025 -
TinyRecursiveModels Public
Forked from SamsungSAILMontreal/TinyRecursiveModelsA fork for TRM
Python MIT License UpdatedOct 7, 2025 -
MiniLive Public
Forked from NVlabs/LongLiveLongLive: Real-time Interactive Long Video Generation
Python Other UpdatedOct 2, 2025 -
bdh Public
Forked from pathwaycom/bdhBaby Dragon Hatchling (BDH) – Architecture and Code
Python MIT License UpdatedOct 1, 2025 -
MobileLLM-R1 Public
Forked from facebookresearch/MobileLLM-R1MobileLLM-R1
Python Other UpdatedSep 30, 2025 -
tinyworlds Public
Forked from AlmondGod/tinyworldsA minimal implementation of DeepMind's Genie world model
Python UpdatedSep 28, 2025 -
-
RL-Factory Public
Forked from Simple-Efficient/RL-FactoryTrain your Agent model via our easy and efficient framework
Python Apache License 2.0 UpdatedSep 11, 2025 -
HRM Public
Forked from sapientinc/HRMHierarchical Reasoning Model Official Release
Python Apache License 2.0 UpdatedSep 9, 2025 -
-
-
-
-
Diffusion-Explorer Public
Forked from helblazer811/Diffusion-ExplorerInteractive visualizations of the theory behind diffusion models.
Svelte UpdatedMay 17, 2025 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedMay 14, 2025 -
MoSA Public
Forked from piotrpiekos/MoSAUser-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.
Python MIT License UpdatedMay 3, 2025