jgh-

Follow

🌳

james h jgh-

🌳

Follow

Software gardener

173 followers · 69 following

aws ivs / twitch
San Francisco, CA

Achievements

Achievements

Organizations

Starred repositories

charmbracelet / crush

Glamourous agentic coding for all 💘

Go 22,701 1,474 Updated Apr 9, 2026

wolfpld / tracy

Frame profiler

C++ 15,600 1,041 Updated Apr 3, 2026

gau-nernst / quantized-training

Explore training for quantized models

Python 26 2 Updated Jul 12, 2025

AnswerDotAI / gpu.cpp

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,961 192 Updated Oct 8, 2025

gpu-mode / awesomeMLSys

An ML Systems Onboarding list

1,030 39 Updated Feb 19, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,760 15,342 Updated Apr 9, 2026

SHI-Labs / NATTEN

Fast Multi-dimensional Sparse Attention

C++ 733 58 Updated Apr 7, 2026

kolinko / effort

An implementation of bucketMul LLM inference

Swift 227 16 Updated Jul 1, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,023 4,776 Updated Apr 8, 2026

Bruce-Lee-LY / cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 535 89 Updated Sep 8, 2024

philipturner / metal-benchmarks

Apple GPU microarchitecture

Metal 593 29 Updated Sep 22, 2024

FL33TW00D / wgpu-mm

WGSL 72 4 Updated Mar 15, 2024

seth-lu / Im2win

C++ 14 Updated May 23, 2023

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,208 397 Updated Jul 11, 2024

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,697 375 Updated Feb 27, 2025

KomputeProject / kompute

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…

C++ 2,484 187 Updated Apr 7, 2026

google / uVkCompute

A micro Vulkan compute pipeline and a collection of benchmarking compute shaders

C++ 262 44 Updated Mar 27, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,643 1,119 Updated Mar 16, 2026

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,939 223 Updated Mar 8, 2024

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 25,217 1,665 Updated Apr 8, 2026

pbelcak / fastfeedforward

A repository for log-time feedforward networks

Python 224 22 Updated Apr 9, 2024

kyegomez / FastFF

Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"

Python 16 Updated Nov 11, 2024

GPUOpen-Effects / FidelityFX-FSR2

FidelityFX Super Resolution 2

C 2,070 208 Updated Aug 26, 2023

microsoft / MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 583 162 Updated Jul 1, 2024

adalkiran / webrtc-nuts-and-bolts

A holistic way of understanding how WebRTC and its protocols run in practice, with code and detailed documentation.

Go 937 45 Updated Sep 19, 2024

moonbeam-foundation / moonbeam

An Ethereum-compatible smart contract parachain on Polkadot

TypeScript 939 382 Updated Apr 8, 2026

paritytech / substrate

Substrate: The platform for blockchain innovators

Rust 8,428 2,650 Updated Sep 25, 2023

tomusdrw / rust-web3

Ethereum JSON-RPC multi-transport client. Rust implementation of web3 library. ENS address: rust-web3.eth

Rust 1,507 474 Updated Oct 24, 2025

nucypher / pyUmbral

NuCypher's reference implementation of Umbral (threshold proxy re-encryption) using OpenSSL and Cryptography.io

Python 298 69 Updated Dec 9, 2022

nibbstack / erc721

The reference implementation of the ERC-721 non-fungible token standard.

Solidity 1,047 396 Updated Aug 10, 2022

Starred topics

coremediaio