Skip to content
View skyne98's full-sized avatar
💭
Buildin'
💭
Buildin'

Block or report skyne98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,296 79 Updated Dec 18, 2024

GPUGrants - a list of GPU grants that I can think of

53 5 Updated Sep 13, 2025

Official inference framework for 1-bit LLMs

Python 24,460 1,913 Updated Jun 3, 2025

Rust library for generating vector embeddings, reranking. Re-write of qdrant/fastembed.

Rust 693 97 Updated Dec 17, 2025

Local first semantic and hybrid BM25 grep / search tool for use by AI and humans!

Rust 1,077 38 Updated Nov 16, 2025

Hindsight: Agent Memory That Works Like Human Memory

Python 630 63 Updated Dec 18, 2025

Atomic secret provisioning for NixOS based on sops

Nix 2,455 200 Updated Dec 15, 2025

🌈 React for interactive command-line apps

TypeScript 33,360 791 Updated Nov 19, 2025

CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge techniques in sparse architecture, speculative sampling and qua…

Cuda 212 21 Updated Oct 10, 2025

Nix-native configuration for niri

Nix 611 87 Updated Dec 18, 2025
Python 53 1 Updated Mar 30, 2025

Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK

C++ 89 5 Updated Dec 2, 2025

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

Go 2,062 138 Updated Dec 14, 2025

A tool for parsing, dumping and modifying data in Radeon PowerPlay tables

Python 177 29 Updated Mar 15, 2025

Fast and Furious AMD Kernels

C++ 321 40 Updated Dec 16, 2025

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 499 258 Updated Dec 18, 2025

Teldrive

Go 2,524 373 Updated Nov 6, 2025

Development repository for the Triton language and compiler

MLIR 17,875 2,455 Updated Dec 18, 2025

🐰 Bencher - Continuous Benchmarking

MDX 777 35 Updated Dec 17, 2025

Rustic bindings to the IREE Compiler/Runtime

Rust 24 5 Updated Aug 18, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,516 810 Updated Dec 18, 2025

A port of the RWKV v7 language model, implemented with the Burn deep learning framework

Rust 10 2 Updated Jun 9, 2025

Models and examples built with Burn

Rust 319 49 Updated Nov 6, 2025

RimSort is an open source mod manager for the video game RimWorld. There is support for Linux, Mac, and Windows, built from the ground up to be a reliable, community-managed alternative to RimPy Mo…

Python 852 92 Updated Dec 18, 2025

Tensor computation with WebGPU acceleration

TypeScript 639 23 Updated Jul 25, 2024

Docker daemon API in Rust

Rust 1,159 159 Updated Dec 14, 2025

Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.

Python 70 18 Updated Dec 18, 2025

A calm, CLI-native way to semantically grep everything, like code, images, pdfs and more.

TypeScript 2,259 99 Updated Dec 17, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,886 3,778 Updated Dec 18, 2025

CUDA on non-NVIDIA GPUs

Rust 13,648 876 Updated Dec 16, 2025
Next