Skip to content
View hjy1's full-sized avatar

Highlights

  • Pro

Block or report hjy1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 78,030 11,377 Updated Mar 26, 2026

Fast Hadamard transform in CUDA, with a PyTorch interface

C 310 60 Updated Mar 10, 2026

Free, open-source SSH terminal, SFTP file manager & S3 browser for macOS, Windows & Linux. Alternative to Termius.

TypeScript 48 5 Updated Apr 23, 2026

LLM inference in C/C++

C++ 107,548 17,598 Updated Apr 30, 2026

My notes on using Linux

Shell 1,015 105 Updated Apr 2, 2026

Powerful system-level package manager for Linux, macOS and Windows written in Rust – building on top of the Conda ecosystem.

Rust 6,941 498 Updated Apr 30, 2026

A conda-forge distribution.

Shell 9,711 496 Updated Apr 21, 2026
Python 18 3 Updated Apr 2, 2026

MathCode: A Frontier Mathematical Coding Agent

Python 483 48 Updated Apr 12, 2026

PLDI'24 Artifact for "The T-Complexity Costs of Error Correction for Control Flow in Quantum Computation".

OCaml 4 1 Updated Sep 29, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 9,128 2,321 Updated Mar 30, 2026

Fast Polar Decomposition for Muon

Python 142 13 Updated Apr 15, 2026
OCaml 16 1 Updated Apr 29, 2026

Development repository for the Triton language and compiler

MLIR 19,083 2,808 Updated Apr 30, 2026

A free and strong UCI xiangqi engine

C++ 1,733 305 Updated Apr 29, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,655 1,830 Updated Apr 25, 2026

Sparse Johnson-Lindenstrauss Transforms CUDA Kernel

Jupyter Notebook 1 Updated Nov 20, 2025

`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.

Python 122 30 Updated Mar 24, 2026

Official inference framework for 1-bit LLMs

Python 38,740 3,510 Updated Mar 10, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,690 2,362 Updated Sep 3, 2025

AI book for everyone

Jupyter Notebook 34 7 Updated Apr 8, 2026

A fast, effective data attribution method for neural networks in PyTorch

Python 237 37 Updated Nov 18, 2024

The best ChatGPT that $100 can buy.

Python 52,723 7,054 Updated Apr 14, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,516 2,333 Updated Apr 30, 2026

Build resilient language agents as graphs.

Python 30,883 5,275 Updated Apr 30, 2026

Open deep learning compiler stack for Kendryte AI accelerators ✨

C# 877 206 Updated Mar 26, 2026

A formalization of geometry in Coq based on Tarski's axiom system

Rocq Prover 207 29 Updated Nov 17, 2025

Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.

Python 772 118 Updated Apr 22, 2026
Python 2,915 623 Updated Apr 29, 2026

My learning notes for ML SYS.

Python 6,159 401 Updated Apr 23, 2026
Next