Skip to content
View nwnk's full-sized avatar

Block or report nwnk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WWIV BBS Software v5

C++ 201 84 Updated Feb 14, 2026

vLLM plugin for RBLN NPU

Python 44 8 Updated Mar 23, 2026

Next-Gen GUI-based WiFi and Bluetooth Analyzer for Linux

Python 1,518 179 Updated Mar 20, 2026

The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.

LLVM 45 7 Updated Oct 25, 2021

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,736 1,409 Updated Jan 28, 2026

The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor…

C 17 6 Updated Mar 28, 2019

A simple guide to compile Llama.cpp and llama-cpp-python using CLBLAST for older generation AMD GPUs.

C 6 2 Updated Sep 3, 2023
Fortran 104 40 Updated Mar 22, 2026

a software library containing BLAS functions written in OpenCL

C++ 865 241 Updated Aug 2, 2024

Intel® GPU Compute Samples

C++ 108 16 Updated Sep 10, 2025

Vulkan mipmap generation with 3 strategies: blit chain, compute with per-level barriers, compute with Subgroup shuffle.

C++ 4 Updated May 23, 2024

Customizable compute shader for fast cache-aware mipmap generation

GLSL 56 2 Updated Sep 7, 2024

An Open Framework for Federated Learning.

Python 833 235 Updated Feb 21, 2026

pretends to export c++ functions but with a c abi

Python 1 Updated Sep 15, 2024

MLX: An array framework for Apple silicon

C++ 24,708 1,601 Updated Mar 23, 2026

Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices

LLVM 875 88 Updated Apr 23, 2025

Examples for building and running LLM services and applications locally with Podman

Python 2 Updated Aug 19, 2024

chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.

LLVM 321 40 Updated Mar 23, 2026

Community maintained container images to use with toolbx and distrobox

Dockerfile 416 39 Updated Dec 17, 2025

On-device AI across mobile, embedded and edge for PyTorch

Python 4,415 891 Updated Mar 23, 2026

glmark2 is an OpenGL 2.0 and ES 2.0 benchmark

C 508 204 Updated Sep 29, 2025

CUDA on non-NVIDIA GPUs

Rust 14,036 900 Updated Mar 23, 2026

Mali G610 & 710 GPU Driver for Termux

C 76 29 Updated Sep 22, 2024

Raspberry Pi 4 UEFI Firmware Images

1,352 166 Updated Mar 22, 2026

Reverse engineered Linux driver for the Apple Neural Engine (ANE).

C 478 26 Updated Mar 12, 2024

MIOpenGEMM is now deprecated

C++ 61 12 Updated Jul 17, 2023

Open source version of RV, the Sci-Tech award-winning media review and playback software.

C++ 708 206 Updated Mar 17, 2026

Implementation of OpenCL 3.0 on Vulkan

C++ 426 49 Updated Mar 23, 2026

An Xlib compatibility layer implemented on top of the Haiku API, in order to run X11 applications on Haiku without an X server.

C 97 3 Updated Aug 29, 2024
Next