Skip to content
View rudkx's full-sized avatar

Organizations

@apple

Block or report rudkx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 251 19 Updated Dec 15, 2025

A fast framework for writing baseline compiler back-ends in C++

LLVM 596 30 Updated Dec 23, 2025

Lightning fast C++/CUDA neural network framework

C++ 4,362 538 Updated Dec 14, 2025

You like pytorch? You like micrograd? You love tinygrad! ❀️

Python 30,926 3,793 Updated Dec 25, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,381 8,612 Updated Nov 12, 2025

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 433 15 Updated Dec 16, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 75,590 2,381 Updated Dec 24, 2025

Modern, open source, self-hosted customer support desk. Single binary app.

Go 1,857 108 Updated Dec 24, 2025

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 16,621 4,580 Updated Jan 7, 2025

HLSL Specifications

TeX 208 54 Updated Dec 18, 2025

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,547 130 Updated Feb 6, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,943 3,861 Updated Dec 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,116 12,169 Updated Dec 25, 2025

An open access book on scientific visualization using python and matplotlib

Python 11,137 1,014 Updated Jan 22, 2024

🎨 A succinct matplotlib wrapper for making beautiful, publication-quality graphics

Python 1,144 98 Updated Feb 27, 2025

Matplotlib styles for scientific plotting

Python 8,464 783 Updated Nov 20, 2025

Exploring the scalable matrix extension of the Apple M4 processor

C 213 12 Updated Nov 7, 2024

16-voice polyphonic Autoregressive Algorithmic Synthesizer

30 1 Updated Dec 13, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,011 1,594 Updated Dec 24, 2025

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 45,254 7,078 Updated Aug 18, 2024

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,005 6,566 Updated Nov 11, 2025

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

Python 6,251 566 Updated Apr 22, 2024

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

67,338 6,719 Updated Dec 6, 2025

πŸ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28,562 3,830 Updated Jul 18, 2024

A massively parallel, optimal functional runtime in Rust

Cuda 11,181 427 Updated Nov 21, 2024

A massively parallel, high-level programming language

Rust 19,122 470 Updated Jun 3, 2025

Development repository for the Triton language and compiler

MLIR 17,923 2,467 Updated Dec 25, 2025

Spike, a RISC-V ISA Simulator

C 2,970 1,006 Updated Dec 23, 2025

Open MPI main development repository

C 2,494 937 Updated Dec 24, 2025

LLM training in simple, raw C/CUDA

Cuda 28,459 3,338 Updated Jun 26, 2025
Next