Skip to content
View usrlocalben's full-sized avatar

Block or report usrlocalben

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get your documents ready for gen AI

Python 47,469 3,333 Updated Dec 19, 2025

High-performance FlashAttention-2 for AMD, Intel, and Apple GPUs. Drop-in replacement for PyTorch SDPA. Triton backend for ROCm (MI300X, RDNA3), Vulkan backend for consumer GPUs. No CUDA required.

Zig 122 5 Updated Dec 20, 2025

CAD files for DDR5 RDIMM cooling

8 1 Updated Jun 12, 2025

Verify Precision of all Kimi K2 API Vendor

Python 488 26 Updated Nov 19, 2025

Set up root-on-zfs using whole disk, with dracut and zfsbootmenu

Shell 43 7 Updated Oct 17, 2025

MIG Partition Editor for NVIDIA GPUs

Go 234 53 Updated Dec 21, 2025

tiny, portable SOCKS5 server with very moderate resource usage

C 1,915 321 Updated Feb 12, 2025

With this program you can bind applications to a specific network interface / network adapter. This is very useful if you have multiple (internet) connections and want your program to use a specifi…

C 133 17 Updated Oct 26, 2025

DDR5 SPD EEPROM recovery tools.

Python 20 4 Updated Oct 28, 2024

Example playbooks to setup your OpenWRT-router with ansible

9 4 Updated May 21, 2017

OpenWrt configuration for router + dumb access points with Ansible playbook for centralised management

Shell 79 10 Updated Nov 4, 2025

MCP server that interacts with Obsidian via the Obsidian rest API community plugin

Python 2,566 324 Updated Jun 28, 2025

Small, pragmatic helpers for building command-line apps with System.CommandLine and the Microsoft.Extensions hosting/DI stack.

C# 7 Updated Nov 23, 2025

llama.cpp fork with additional SOTA quants and improved performance

C++ 1,395 166 Updated Dec 22, 2025

Simple hierarchic configuration manager for apps

Python 2 Updated Sep 3, 2024

Muffin is a fast, simple and asyncronous web-framework for Python 3

Python 686 24 Updated Nov 6, 2025

the little async embedded visualization framework that could (Python)

Python 55 3 Updated May 11, 2022

AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim.

Python 1,098 106 Updated Nov 3, 2025

Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.

C 21 2 Updated Sep 1, 2025

Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.

Cuda 22 Updated Nov 26, 2025

Puzzles for learning Triton

Jupyter Notebook 2,191 179 Updated Nov 18, 2024

Virtual File System C++

C++ 390 50 Updated Dec 22, 2025

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

C++ 212 31 Updated Dec 4, 2025

Light continuous delivery for Docker Compose

Rust 32 1 Updated Sep 14, 2025

Palm is a tree, not a language model

Python 9 2 Updated Aug 30, 2025

UTF-8 with C++ in a Portable Way

C++ 1,874 229 Updated Nov 10, 2025

unpacker for Microsoft EXEPACK

C 27 7 Updated Apr 14, 2019

RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks

Jupyter Notebook 224 20 Updated Jun 20, 2025

Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase

Python 13,422 1,010 Updated Dec 19, 2025

NVIDIA Linux open GPU with P2P support

C 98 17 Updated Dec 4, 2025
Next