GHGmc2

🎯

Focusing

Maozhou Ge GHGmc2

🎯

Focusing

Scaling Computation

94 followers · 760 following

Shanghai

Achievements

Lists (1)

Sort

Scale

13 repositories

Starred repositories

jet-ai-projects / Lightning-OPD

Python 57 5 Updated May 12, 2026

lightvector / KataGo

GTP engine and self-play learning in Go

C++ 4,712 721 Updated Jun 22, 2026

ericjang / autogo

Autoresearch for Go

Python 215 27 Updated May 15, 2026

RL-Align / RL-Kernel

Modern RL Post-training Infrastructure: Optimized for NVIDIA/AMD GPUs with a focus on vLLM and DeepSpeed integration, CUDA/ROCm/Triton kernels, and transparent hardware-aware scaling.

Python 150 32 Updated Jun 22, 2026

kenjihiranabe / The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 21,593 2,564 Updated Jun 30, 2025

Accio-Lab / Dressage

Python 84 4 Updated Jun 20, 2026

nvidia-cosmos / cosmos-rl

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 450 64 Updated Jun 15, 2026

uw-syfi / piper

A programmable distributed training system for PyTorch

Python 17 2 Updated Jun 10, 2026

recursive-org / first-steps-toward-automated-ai-research

Research artifacts from Recursive's automated AI research system

Python 133 12 Updated Jun 11, 2026

openxla / xprof

A profiling and performance analysis tool for machine learning

C++ 543 88 Updated Jun 23, 2026

vllm-project / vime

An LLM post-training framework with vLLM for RL Scaling

Python 294 32 Updated Jun 23, 2026

verl-project / uni-agent

A unified framework for building, running, and training general agents at scale.

Python 359 54 Updated Jun 18, 2026

Tencent-Hunyuan / UniRL

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 697 43 Updated Jun 23, 2026

tilde-research / comp-muon-release

Compositional Muon release

Python 22 4 Updated Jun 5, 2026

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 480 32 Updated May 20, 2026

meta-pytorch / remat

torch_remat fine-grained activation checkpointing API

Python 13 Updated Jun 8, 2026

verl-project / rl-insight

Provide performance insight capabilities for RL frameworks.

Python 36 26 Updated Jun 23, 2026

thinking-machines-lab / batch_invariant_ops

Python 1,028 79 Updated Nov 4, 2025

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,516 598 Updated May 23, 2026

getpaseo / paseo

Orchestrate multiple coding agents from desktop and mobile

TypeScript 9,049 862 Updated Jun 23, 2026

harnets / multiverse

GPU-accelerated LLM Training Simulator

Makefile 22 9 Updated Jun 26, 2025

duoan / TorchCode

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 4,217 365 Updated May 25, 2026

open-lm-engine / coda-kernels

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 216 22 Updated Jun 23, 2026

inclusionAI / asystem-awex

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

Python 160 18 Updated May 25, 2026