Skip to content
View nwnk's full-sized avatar

Block or report nwnk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WWIV BBS Software v5

C++ 198 82 Updated Feb 2, 2026

vLLM plugin for RBLN NPU

Python 41 8 Updated Feb 10, 2026

Next-Gen GUI-based WiFi and Bluetooth Analyzer for Linux

Python 1,481 176 Updated Dec 8, 2025

The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.

LLVM 44 7 Updated Oct 25, 2021

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,672 1,402 Updated Jan 28, 2026

The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor…

C 17 6 Updated Mar 28, 2019

A simple guide to compile Llama.cpp and llama-cpp-python using CLBLAST for older generation AMD GPUs.

C 6 1 Updated Sep 3, 2023
Fortran 103 39 Updated Feb 9, 2026

a software library containing BLAS functions written in OpenCL

C++ 865 241 Updated Aug 2, 2024

Intel® GPU Compute Samples

C++ 109 16 Updated Sep 10, 2025

Vulkan mipmap generation with 3 strategies: blit chain, compute with per-level barriers, compute with Subgroup shuffle.

C++ 4 Updated May 23, 2024

Customizable compute shader for fast cache-aware mipmap generation

GLSL 56 2 Updated Sep 7, 2024

An Open Framework for Federated Learning.

Python 829 235 Updated Jan 15, 2026

pretends to export c++ functions but with a c abi

Python 1 Updated Sep 15, 2024

MLX: An array framework for Apple silicon

C++ 23,870 1,502 Updated Feb 10, 2026

Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices

LLVM 872 88 Updated Apr 23, 2025

Examples for building and running LLM services and applications locally with Podman

Python 2 Updated Aug 19, 2024

chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.

LLVM 315 40 Updated Feb 10, 2026

Community maintained container images to use with toolbx and distrobox

Dockerfile 415 39 Updated Dec 17, 2025

On-device AI across mobile, embedded and edge for PyTorch

Python 4,254 831 Updated Feb 10, 2026

glmark2 is an OpenGL 2.0 and ES 2.0 benchmark

C 507 203 Updated Sep 29, 2025

CUDA on non-NVIDIA GPUs

Rust 13,920 895 Updated Feb 9, 2026

Mali G610 & 710 GPU Driver for Termux

C 76 28 Updated Sep 22, 2024

Raspberry Pi 4 UEFI Firmware Images

1,339 164 Updated Nov 21, 2025

Reverse engineered Linux driver for the Apple Neural Engine (ANE).

C 454 23 Updated Mar 12, 2024

MIOpenGEMM is now deprecated

C++ 61 12 Updated Jul 17, 2023

Open source version of RV, the Sci-Tech award-winning media review and playback software.

C++ 701 203 Updated Feb 9, 2026

Implementation of OpenCL 3.0 on Vulkan

C++ 424 48 Updated Feb 9, 2026

An Xlib compatibility layer implemented on top of the Haiku API, in order to run X11 applications on Haiku without an X server.

C 97 3 Updated Aug 29, 2024
Next