Skip to content
View rudkx's full-sized avatar

Block or report rudkx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 441 28 Updated Mar 30, 2026

A fast framework for writing baseline compiler back-ends in C++

LLVM 653 36 Updated Mar 31, 2026

Lightning fast C++/CUDA neural network framework

C++ 4,499 572 Updated Apr 21, 2026

You like pytorch? You like micrograd? You love tinygrad! ❀️

Python 33,096 4,185 Updated Jun 16, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,737 10,303 Updated Nov 12, 2025

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 487 26 Updated Jun 11, 2026

An extremely fast Python package and project manager, written in Rust.

Rust 86,462 3,210 Updated Jun 16, 2026

Open source, self-hosted omnichannel customer support desk. Live chat, email, and more in a single binary.

Go 2,571 197 Updated Jun 16, 2026

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 18,170 4,989 Updated Feb 11, 2026

HLSL Specifications

TeX 222 56 Updated Jun 8, 2026

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,571 134 Updated Mar 5, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,081 6,557 Updated Jun 16, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,073 18,122 Updated Jun 16, 2026

An open access book on scientific visualization using python and matplotlib

Python 11,321 1,012 Updated Jan 4, 2026

🎨 A succinct matplotlib wrapper for making beautiful, publication-quality graphics

Python 1,154 95 Updated Feb 27, 2025

Matplotlib styles for scientific plotting

Python 8,981 815 Updated Feb 25, 2026

Exploring the scalable matrix extension of the Apple M4 processor

C 231 13 Updated Nov 7, 2024

16-voice polyphonic Autoregressive Algorithmic Synthesizer

32 2 Updated Apr 16, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,906 1,911 Updated Jun 16, 2026

Learn how to develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 48,156 7,577 Updated Mar 4, 2026

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 66,942 6,709 Updated Jan 22, 2026

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

Python 6,460 596 Updated Apr 22, 2024

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

71,783 7,017 Updated Jan 4, 2026

πŸ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.

29,787 3,949 Updated Jul 18, 2024

A massively parallel, optimal functional runtime in Rust

Cuda 11,284 434 Updated Nov 21, 2024

A massively parallel, high-level programming language

Rust 19,470 480 Updated Jun 3, 2025

Development repository for the Triton language and compiler

MLIR 19,458 2,938 Updated Jun 16, 2026

Spike, a RISC-V ISA Simulator

C 3,145 1,071 Updated Jun 9, 2026

Open MPI main development repository

C 2,600 962 Updated Jun 15, 2026

LLM training in simple, raw C/CUDA

Cuda 30,236 3,648 Updated Jun 26, 2025
Next