Skip to content
View Jiexin-Zheng's full-sized avatar

Block or report Jiexin-Zheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA Templates for Linear Algebra Subroutines

C++ 1 Updated May 21, 2026

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 548 142 Updated Oct 28, 2025

A light llama-like llm inference framework based on the triton kernel.

Python 188 32 Updated Jan 5, 2026

OpenAI Triton backend for Intel® GPUs

MLIR 255 100 Updated Jun 12, 2026

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

C++ 77 104 Updated Jun 12, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 1 Updated Jun 12, 2026

how to optimize some algorithm in cuda.

Cuda 3,076 279 Updated Jun 9, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,266 1,313 Updated Jun 7, 2026

码农的荒岛求生

715 119 Updated Dec 16, 2021

Tools to run and parse MKL verbose mode

Python 18 4 Updated Jun 28, 2022

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Python 1 1 Updated Dec 6, 2022

《Machine Learning Systems: Design and Implementation》 (V2 is launching soon)

TeX 4,808 477 Updated Mar 15, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,923 2,396 Updated Sep 3, 2025

Markdown lint tool

Ruby 2,055 245 Updated Jun 5, 2026

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 743 93 Updated Jan 26, 2023

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,756 326 Updated Oct 19, 2024

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,301 332 Updated May 16, 2023

A compiler from Doxygen XML to reStructuredText -- hence, the name. It parses XML databases generated by Doxygen and produces reStructuredText for the Python documentation generator Sphinx.

C++ 310 24 Updated Mar 27, 2026

Static code checker for C++

Python 1,819 304 Updated Jun 2, 2026

oneAPI Deep Neural Network Library (oneDNN)

C++ 4,007 1,147 Updated Jun 12, 2026

Detailed comments for ORB-SLAM2 with trouble-shooting, key formula derivation, and diagrammatic drawing

C++ 1,676 558 Updated May 25, 2023

pytorch memory track code

Python 1,013 152 Updated May 4, 2021

Library targeting Intel Architecture for small, dense or sparse matrix multiplications, and small convolutions.

C 1 1 Updated Oct 4, 2018

model optimization, model compression, model pruning

Python 3 Updated Apr 13, 2023

Uniform Manifold Approximation and Projection

Python 8,204 864 Updated Jun 6, 2026

A repository of different Algorithms and Data Structures implemented in many programming languages.

C++ 789 1,003 Updated Jan 9, 2024

This Repo consists of Data structures and Algorithms

C++ 668 254 Updated Apr 1, 2024

Data Structures and Algorithms implemented In Python, C, C++, Java or any other languages. Aimed to help strengthen the concepts of DSA. Give a Star 🌟 if it helps you.

C++ 272 390 Updated Oct 20, 2022

Algorithms & Data structures in C++.

C++ 5,454 1,523 Updated Aug 1, 2024
Next