Skip to content
View jgh-'s full-sized avatar
🌳
🌳
  • aws ivs / twitch
  • San Francisco, CA

Organizations

@unpause-live

Block or report jgh-

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Glamourous agentic coding for all 💘

Go 22,701 1,474 Updated Apr 9, 2026

Frame profiler

C++ 15,600 1,041 Updated Apr 3, 2026

Explore training for quantized models

Python 26 2 Updated Jul 12, 2025

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,961 192 Updated Oct 8, 2025

An ML Systems Onboarding list

1,030 39 Updated Feb 19, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,760 15,342 Updated Apr 9, 2026

Fast Multi-dimensional Sparse Attention

C++ 733 58 Updated Apr 7, 2026

An implementation of bucketMul LLM inference

Swift 227 16 Updated Jul 1, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,023 4,776 Updated Apr 8, 2026

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 535 89 Updated Sep 8, 2024

Apple GPU microarchitecture

Metal 593 29 Updated Sep 22, 2024
WGSL 72 4 Updated Mar 15, 2024
C++ 14 Updated May 23, 2023

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,208 397 Updated Jul 11, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,697 375 Updated Feb 27, 2025

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…

C++ 2,484 187 Updated Apr 7, 2026

A micro Vulkan compute pipeline and a collection of benchmarking compute shaders

C++ 262 44 Updated Mar 27, 2025

Machine Learning Engineering Open Book

Python 17,643 1,119 Updated Mar 16, 2026

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,939 223 Updated Mar 8, 2024

MLX: An array framework for Apple silicon

C++ 25,217 1,665 Updated Apr 8, 2026

A repository for log-time feedforward networks

Python 224 22 Updated Apr 9, 2024

Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"

Python 16 Updated Nov 11, 2024

FidelityFX Super Resolution 2

C 2,070 208 Updated Aug 26, 2023

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 583 162 Updated Jul 1, 2024

A holistic way of understanding how WebRTC and its protocols run in practice, with code and detailed documentation.

Go 937 45 Updated Sep 19, 2024

An Ethereum-compatible smart contract parachain on Polkadot

TypeScript 939 382 Updated Apr 8, 2026

Substrate: The platform for blockchain innovators

Rust 8,428 2,650 Updated Sep 25, 2023

Ethereum JSON-RPC multi-transport client. Rust implementation of web3 library. ENS address: rust-web3.eth

Rust 1,507 474 Updated Oct 24, 2025

NuCypher's reference implementation of Umbral (threshold proxy re-encryption) using OpenSSL and Cryptography.io

Python 298 69 Updated Dec 9, 2022

The reference implementation of the ERC-721 non-fungible token standard.

Solidity 1,047 396 Updated Aug 10, 2022
Next