Skip to content
View zhiics's full-sized avatar

Organizations

@apache @dmlc

Block or report zhiics

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,681 384 Updated Sep 17, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,706 3,667 Updated Oct 8, 2025

Productive, portable, and performant GPU programming in Python.

C++ 27,585 2,364 Updated Oct 6, 2025

MSVC's implementation of the C++ Standard Library.

C++ 10,815 1,583 Updated Oct 9, 2025

Pyodide is a Python distribution for the browser and Node.js based on WebAssembly

Python 13,758 976 Updated Oct 8, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,039 3,873 Updated Oct 9, 2025

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,604 2,261 Updated Sep 23, 2025

A high performance and generic framework for distributed DNN training

Python 3,704 494 Updated Oct 3, 2023

Machine learning, in numpy

Python 16,157 3,795 Updated Oct 29, 2023

A General-purpose Task-parallel Programming System using Modern C++

C++ 11,279 1,316 Updated Oct 8, 2025

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,827 6,751 Updated Oct 25, 2023

TVM integration into PyTorch

C++ 454 64 Updated Jan 15, 2020

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,216 2,075 Updated Oct 9, 2025

Low-precision matrix multiplication

C++ 1,815 457 Updated Jan 29, 2024

An experimental ahead of time compiler for Relay.

Python 50 8 Updated Apr 21, 2020

Nodejs extension host for vim & neovim, load extensions like VSCode and host language servers.

TypeScript 25,010 964 Updated Sep 26, 2025

"Multi-Level Intermediate Representation" Compiler Infrastructure

1,758 258 Updated Apr 22, 2021

Optional static typing for Python

Python 19,872 3,025 Updated Oct 9, 2025

C++ python bytecode disassembler and decompiler

C++ 4,022 760 Updated Aug 30, 2025

Repository for the book "Crafting Interpreters"

HTML 10,134 1,183 Updated Aug 7, 2024

An open-source NLP research library, built on PyTorch.

Python 11,879 2,239 Updated Nov 22, 2022

Fast serialization framework for C

C 225 21 Updated Aug 26, 2017

Lightweight profiler library for c++

C++ 2,307 202 Updated Jul 15, 2025

A polyhedral compiler for expressing fast and portable data parallel algorithms

C++ 951 136 Updated Nov 20, 2024

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

C++ 5,027 826 Updated Jun 17, 2024

The Python programming language

Python 69,235 33,043 Updated Oct 9, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,652 3,195 Updated Oct 9, 2025

A technical report on convolution arithmetic in the context of deep learning

TeX 14,503 2,306 Updated Jun 8, 2023

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,296 2,128 Updated Aug 18, 2025

Bonus materials, exercises, and example projects for our Python tutorials

HTML 5,007 5,323 Updated Oct 9, 2025
Next