Skip to content
View Hiroki11x's full-sized avatar
🦙
🦙

Organizations

@jphacks @rioyokotalab @crest-deep @TITAMAS @RotaPlusPlus @Agents-NY @ArtHackDay-Plus1 @MLHPC

Block or report Hiroki11x

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of optimization problems in mathematics

HTML 217 33 Updated Feb 18, 2026

A theory of optimal learning rate schedules in SGD from optimal control theory

Jupyter Notebook 1 Updated Feb 17, 2026

Scalable Computing for Advanced Library and Environment

Fortran 22 7 Updated Sep 18, 2025

Spectral Sphere Optimizer

Python 98 1 Updated Jan 14, 2026

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

68,684 6,831 Updated Jan 4, 2026

Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training (TMLR2025)

Python 1 Updated Aug 24, 2025

Implementatoin for paper: A Unified Stability Analysis of SAM vs SGD: Role of Data Coherence and Emergence of Simplicity Bias

Python 2 Updated Oct 18, 2025

CellViT: Vision Transformers for Precise Cell Segmentation and Classification

Python 359 63 Updated Jul 23, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,165 641 Updated Feb 18, 2026
Python 6 Updated Dec 2, 2025
Python 15 1 Updated Dec 11, 2025

Muon is Scalable for LLM Training

1,434 82 Updated Aug 3, 2025

Open-source framework for the research and development of foundation models.

HTML 761 83 Updated Feb 18, 2026

Official Implementation for NorMuon paper

Python 56 3 Updated Feb 9, 2026

Control LLM

Python 22 4 Updated Apr 6, 2025

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

Python 716 49 Updated Jan 30, 2026

fmchisel: Efficient Compression and Training Algorithms for Foundation Models

Python 83 10 Updated Oct 23, 2025

Dion optimizer algorithm

Python 438 49 Updated Jan 16, 2026
Python 131 20 Updated Sep 9, 2025

Benchmarking Optimizers for LLM Pretraining

Python 51 3 Updated Dec 30, 2025

[ICLR 2025] How Does Critical Batch Size Scale in Pre-training?

Jupyter Notebook 10 1 Updated Feb 20, 2025

S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/

Python 1,011 76 Updated Apr 26, 2024

KFAC from scratch (KFS)---Paper & Code

TeX 7 1 Updated Jan 13, 2026

Minimal reference implementations for per-example gradient norm methods for computing GNS

Jupyter Notebook 9 1 Updated Nov 15, 2024

Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime

Python 6 Updated Nov 28, 2025
Next