Skip to content
View Cjkkkk's full-sized avatar
🏠
coding...
🏠
coding...

Block or report Cjkkkk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,083 140 Updated Jun 23, 2026

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 447 66 Updated Jan 5, 2026

pprof is a tool for visualization and analysis of profiling data

Go 9,213 663 Updated Jun 4, 2026

Puzzles for learning Triton

Jupyter Notebook 2,499 240 Updated Apr 1, 2026

JAX-Toolbox

Python 415 76 Updated Jun 23, 2026
Python 356 31 Updated Apr 13, 2026

A simple, performant and scalable Jax LLM!

Python 2,334 542 Updated Jun 23, 2026

Development repository for the Triton language and compiler

MLIR 19,510 2,958 Updated Jun 23, 2026
C++ 9 1 Updated Oct 31, 2022

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 556 72 Updated Jun 4, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,848 2,070 Updated May 11, 2026

A VM That is Dynamic and Fast

C 1,662 60 Updated Jun 8, 2025

A machine learning framework project motivated by CMU-10414

Python 1 Updated Dec 16, 2022

a language for fast, portable data-parallel computation

C++ 1 Updated Nov 10, 2025

Container plugin for Slurm Workload Manager

C 452 43 Updated May 12, 2026

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Shell 968 130 Updated Jun 9, 2026

Summer 2026 software engineering, data science, AI, quant, product management, and hardware internship postings. Updated daily by Simplify and Pitt CSC.

Python 45,025 3,178 Updated Jun 23, 2026

Various translations of OSTEP can be found here. Help the cause and contribute!

3,071 512 Updated Jan 20, 2025

MIT 6.824 (Distributed Systems) labs in Go

Go 234 63 Updated Feb 22, 2021

A library for replicating your python class between multiple servers, based on raft protocol

Python 750 119 Updated Mar 17, 2026

HIPIFY: Convert CUDA to Portable C++ Code

C++ 705 107 Updated Jun 23, 2026

A GPU benchmark suite for assessing on-chip GPU memory bandwidth

C++ 113 28 Updated Aug 12, 2017

AI education materials for Chinese students, teachers and IT professionals.

HTML 14,069 2,937 Updated May 16, 2024

portion, a Python library providing data structure and operations for intervals.

Python 523 40 Updated Jun 14, 2026

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,677 180 Updated Jan 21, 2026

The reference implementation of the Linux FUSE (Filesystem in Userspace) interface

C 6,076 1,276 Updated Jun 23, 2026

TensorFlow code and pre-trained models for BERT

Python 40,038 9,700 Updated Jul 23, 2024

This is the top-level repository for the Accel-Sim framework.

Python 613 220 Updated Mar 24, 2026

A polyhedral compiler for expressing fast and portable data parallel algorithms

C++ 960 137 Updated Nov 20, 2024

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…

Python 4,773 1,206 Updated Jul 15, 2024
Next