Skip to content
View 1duo's full-sized avatar
:octocat:
:octocat:

Organizations

@ucdavis @gunrock @conda-forge

Block or report 1duo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Helpful kernel tutorials and examples for tile-based GPU programming

Python 455 22 Updated Dec 19, 2025

An API-compatible, drop-in replacement for Apple's Foundation Models framework with support for custom language model providers.

Swift 604 35 Updated Dec 17, 2025

Modular RDMA Interface

C++ 67 15 Updated Dec 19, 2025

Fast and Furious AMD Kernels

C++ 324 40 Updated Dec 19, 2025

Nano vLLM

Python 9,793 1,232 Updated Nov 3, 2025

AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

Python 140 27 Updated Dec 19, 2025

The best ChatGPT that $100 can buy.

Python 38,891 4,911 Updated Dec 9, 2025

Post-training with Tinker

Python 2,578 247 Updated Dec 19, 2025

Training API and CLI

Python 266 28 Updated Dec 15, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,689 218 Updated Dec 19, 2025

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 3,003 110 Updated Dec 19, 2025

(WIP) A small but powerful, homemade PyTorch from scratch.

C 662 33 Updated Dec 19, 2025

Dion optimizer algorithm

Python 405 41 Updated Dec 18, 2025

Hierarchical Reasoning Model Official Release

Python 12,158 1,779 Updated Sep 9, 2025

An extremely fast Python type checker and language server, written in Rust.

Python 15,032 157 Updated Dec 19, 2025

Artificial Neural Engine Machine Learning Library

Python 1,276 48 Updated Dec 5, 2025

An Extensible Compiler IR Framework

Rust 228 21 Updated Dec 18, 2025

Free, simple, fast interactive diagrams for any GitHub repository

TypeScript 15,008 1,137 Updated May 26, 2025

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 866 130 Updated Dec 18, 2025

LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via efficient conversion, runtime, and optimization

C++ 1,135 156 Updated Dec 19, 2025

QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.

C++ 36 7 Updated Aug 29, 2025

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 953 385 Updated Dec 10, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,841 4,232 Updated Dec 14, 2025

ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.

Python 414 96 Updated Dec 19, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,988 702 Updated Aug 18, 2024
Python 1,454 121 Updated Feb 15, 2025

ModernBERT model optimized for Apple Neural Engine.

Python 29 2 Updated Jan 10, 2025

Moved to Codeberg

Zig 42,602 3,111 Updated Nov 27, 2025

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 539 49 Updated Sep 13, 2025
Next