Skip to content
View jgh-'s full-sized avatar
🌳
🌳
  • aws ivs / twitch
  • San Francisco, CA

Organizations

@unpause-live

Block or report jgh-

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Glamourous agentic coding for all 💘

Go 22,138 1,419 Updated Mar 28, 2026

Frame profiler

C++ 15,534 1,036 Updated Mar 28, 2026

Explore training for quantized models

Python 26 2 Updated Jul 12, 2025

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,957 190 Updated Oct 8, 2025

An ML Systems Onboarding list

1,024 38 Updated Feb 19, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,612 14,911 Updated Mar 29, 2026

Fast Multi-dimensional Sparse Attention

C++ 730 58 Updated Mar 25, 2026

An implementation of bucketMul LLM inference

Swift 227 16 Updated Jul 1, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,935 4,768 Updated Mar 29, 2026

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 534 88 Updated Sep 8, 2024

Apple GPU microarchitecture

Metal 588 29 Updated Sep 22, 2024
WGSL 72 4 Updated Mar 15, 2024
C++ 14 Updated May 23, 2023

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,206 396 Updated Jul 11, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,649 371 Updated Feb 27, 2025

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…

C++ 2,479 185 Updated Mar 18, 2026

A micro Vulkan compute pipeline and a collection of benchmarking compute shaders

C++ 261 44 Updated Mar 27, 2025

Machine Learning Engineering Open Book

Python 17,567 1,115 Updated Mar 16, 2026

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,938 223 Updated Mar 8, 2024

MLX: An array framework for Apple silicon

C++ 24,857 1,611 Updated Mar 26, 2026

A repository for log-time feedforward networks

Python 224 22 Updated Apr 9, 2024

Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"

Python 16 Updated Nov 11, 2024

FidelityFX Super Resolution 2

C 2,071 207 Updated Aug 26, 2023

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 583 160 Updated Jul 1, 2024

A holistic way of understanding how WebRTC and its protocols run in practice, with code and detailed documentation.

Go 937 45 Updated Sep 19, 2024

An Ethereum-compatible smart contract parachain on Polkadot

TypeScript 939 385 Updated Mar 27, 2026

Substrate: The platform for blockchain innovators

Rust 8,429 2,653 Updated Sep 25, 2023

Ethereum JSON-RPC multi-transport client. Rust implementation of web3 library. ENS address: rust-web3.eth

Rust 1,513 476 Updated Oct 24, 2025

NuCypher's reference implementation of Umbral (threshold proxy re-encryption) using OpenSSL and Cryptography.io

Python 300 70 Updated Dec 9, 2022

The reference implementation of the ERC-721 non-fungible token standard.

Solidity 1,047 396 Updated Aug 10, 2022
Next