Skip to content
View harborn's full-sized avatar

Block or report harborn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

super repo for rocm systems projects

C++ 184 93 Updated Dec 23, 2025

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 500 258 Updated Dec 23, 2025

Analyze computation-communication overlap in V3/R1.

1,128 144 Updated Mar 21, 2025

Modular RDMA Interface

C++ 67 15 Updated Dec 23, 2025

AI Tensor Engine for ROCm

Python 325 164 Updated Dec 23, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,327 1,096 Updated Dec 2, 2025

Public repo for HF blog posts

Jupyter Notebook 3,272 958 Updated Dec 22, 2025

When it comes to optimizers, it's always better to be safe than sorry

Python 398 14 Updated Sep 26, 2025

High-Performance C++ Fundamental Library

C++ 625 93 Updated Dec 22, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,506 696 Updated Dec 23, 2025

Fast and memory-efficient exact attention

Python 21,261 2,244 Updated Dec 23, 2025

Transformers 库快速入门教程

Python 1,788 211 Updated Sep 20, 2024

LLM101n: Let's build a Storyteller

35,935 1,961 Updated Aug 1, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,354 1,093 Updated Dec 10, 2025

LLM training in simple, raw C/CUDA

Cuda 28,455 3,336 Updated Jun 26, 2025

Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.

Shell 713 332 Updated Dec 22, 2025

Proxy: Next Generation Polymorphism in C++

C++ 3,041 214 Updated Dec 6, 2025

Open source code for AlphaFold 2.

Python 14,123 2,525 Updated Oct 31, 2025

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 3,232 647 Updated Dec 16, 2025

The Serenity Operating System 🐞

C++ 32,721 3,290 Updated Dec 23, 2025

PyTorch Tutorial for Deep Learning Researchers

Python 32,052 8,275 Updated Aug 15, 2023

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 131 35 Updated Sep 23, 2025

LLM inference in C/C++

C++ 91,900 14,209 Updated Dec 23, 2025

A data oriented, simple but powerful DSL language.

Go 48 8 Updated Nov 21, 2025

Port of OpenAI's Whisper model in C/C++

C++ 45,260 5,034 Updated Dec 18, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,109 26,343 Updated Dec 23, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,991 6,184 Updated Sep 18, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,230 357 Updated Dec 22, 2025

Hash function quality and speed tests

C++ 2,107 189 Updated Dec 2, 2025

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,442 189 Updated Dec 21, 2025
Next