Skip to content
View saiksaketh's full-sized avatar

Highlights

  • Pro

Block or report saiksaketh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GPU accelerated decision optimization

Cuda 532 88 Updated Nov 5, 2025

Portable GPU Programming

C++ 22 9 Updated Oct 31, 2025

Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model smaller while preserving accuracy.

Python 564 43 Updated Oct 31, 2025

Measure and optimize the energy consumption of your AI applications!

Python 307 37 Updated Oct 25, 2025

Efficient AI-Enhanced 5G PUSCH Receiver

Python 1 Updated Aug 5, 2025

Efficient AI-Enhanced 5G PUSCH Receiver

Python 3 1 Updated Oct 19, 2025

Implementation of Axial attention - attending to multi-dimensional data efficiently

Python 388 33 Updated Aug 26, 2021

A curated list of materials on AI efficiency

183 16 Updated Nov 1, 2025
MATLAB 3 Updated May 20, 2025
Python 4 Updated Sep 10, 2025

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Jupyter Notebook 6,937 1,592 Updated Sep 23, 2025

Efficient Knowledge Injection in LLMs via Self-Distillation (TMLR)

Python 6 1 Updated Aug 5, 2025

Fast low-bit matmul kernels in Triton

Python 392 29 Updated Oct 26, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,952 11,506 Updated Nov 3, 2025

An official implementation of "Scheduling Weight Transitions for Quantization-Aware Training" (ICCV 2025) in PyTorch.

Python 3 1 Updated Aug 21, 2025

Neural network quantization for research and prototyping

Python 38 Updated Nov 3, 2025

Real-Time Inference of 5G NR Multi-user MIMO Neural Receivers

Jupyter Notebook 73 15 Updated Apr 15, 2025

Complete solutions to the Programming Massively Parallel Processors Edition 4

Jupyter Notebook 563 74 Updated Jun 18, 2025

Source code of the Paper "Sparse Bayesian Generative Modeling for Compressive Sensing" (NeurIPS 24)

Python 8 1 Updated May 30, 2025
Python 1 Updated Jul 29, 2025

Code for the book "The Elements of Differentiable Programming".

Python 273 23 Updated Jun 21, 2025

Inference Llama 2 in one file of pure C

C 18,912 2,400 Updated Aug 6, 2024

Sionna Research Kit: A GPU-Accelerated Research Platform for AI-RAN

Jupyter Notebook 53 10 Updated Oct 15, 2025

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

Python 440 50 Updated Dec 22, 2023

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

Jupyter Notebook 60 8 Updated Jan 8, 2025

CS433 project. Implement Post-training Quantization method ACIQ and ADAROUND.

Python 9 Updated Mar 24, 2023

Code for the paper "Cauchy-Schwarz Regularizers" from ICLR 2025

Python 4 1 Updated Feb 28, 2025
Next