airMeng

🇸🇭

I will not serve

Meng, Hengyu airMeng

🇸🇭

I will not serve

34 followers · 53 following

Achievements

x4 x3

Achievements

x4 x3

Organizations

Stars

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 16,768 1,980 Updated Nov 4, 2025

intel / auto-round

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantization, MXFP4, NVFP4, GGUF, and adaptive schemes.

Python 844 77 Updated Feb 4, 2026

intel / intel-npu-acceleration-library

Intel® NPU Acceleration Library

Python 703 82 Updated Apr 24, 2025

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

C++ 352 39 Updated Aug 30, 2024

intel / xetla

C++ 61 20 Updated Dec 18, 2024

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,815 256 Updated Jan 31, 2026

intel / intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,174 216 Updated Oct 8, 2024

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,581 295 Updated Feb 4, 2026

sol-prog / x86-64-minimal-JIT-compiler-Cpp

Writing a minimal x86-64 JIT compiler in C++

C++ 106 17 Updated Apr 28, 2018

huggingface / optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook 532 185 Updated Feb 3, 2026

intel / intel-extension-for-tensorflow

Intel® Extension for TensorFlow*

C++ 349 45 Updated Oct 29, 2025

mlc-ai / notebooks

Jupyter Notebook 221 81 Updated Nov 22, 2024

Lewuathe / mlir-hello

MLIR Sample dialect

C++ 136 36 Updated Dec 23, 2025

polymage-labs / mlirx

MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com

38 9 Updated Dec 1, 2023

deepsea-inria / pasl

Parallel Algorithm Scheduling Library

C++ 105 20 Updated Jul 24, 2017

pigirons / sgemm_hsw

This is an implementation of sgemm_kernel on L1d cache.

Assembly 233 33 Updated Feb 26, 2024

sampsyo / bril

an educational compiler intermediate representation

Rust 732 323 Updated Jan 5, 2026

OpenPPL / ppl.nn

A primitive library for neural network

C++ 1,369 223 Updated Nov 24, 2024

merrymercy / awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,725 324 Updated Oct 19, 2024

marsupialtail / sparsednn

Fast sparse deep learning on CPUs

Python 56 8 Updated Sep 28, 2022

NervanaSystems / maxas

Assembler for NVIDIA Maxwell architecture

Sass 1,060 171 Updated Jan 3, 2023

oneapi-src / oneAPI-samples

Samples for Intel® oneAPI Toolkits

C++ 1,125 744 Updated Jan 29, 2026

jmmartinez / easy-just-in-time

LLVM Optimization to extract a function, embedded in its intermediate representation in the binary, and execute it using the LLVM Just-In-Time compiler.

C++ 531 31 Updated May 15, 2021

Talmaj / onnx2pytorch

Transform ONNX model to PyTorch representation

Python 345 70 Updated Nov 4, 2025

jeffhammond / dpcpp-tutorial

Intel Data Parallel C++ (and SYCL 2020) Tutorial.

C++ 95 16 Updated Dec 15, 2021

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,305 335 Updated May 16, 2023

onnx / onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 974 400 Updated Feb 4, 2026

chunyuan-w / ipex_verbose

ipex verbose toolkit

Python 2 Updated Mar 10, 2022

gangiman / PySparseConvNet

Python Framework for sparse neural networks

Cuda 19 5 Updated Apr 28, 2017

matazure / mtensor

a c++/cuda template library for tensor lazy evaluation

C++ 164 38 Updated May 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meng, Hengyu airMeng

Achievements

Achievements

Organizations

Block or report airMeng

Stars

richards199999 / Thinking-Claude

intel / auto-round

intel / intel-npu-acceleration-library

intel / neural-speed

intel / xetla

BBuf / how-to-optim-algorithm-in-cuda

intel / intel-extension-for-transformers

intel / neural-compressor

sol-prog / x86-64-minimal-JIT-compiler-Cpp

huggingface / optimum-intel

intel / intel-extension-for-tensorflow

mlc-ai / notebooks

Lewuathe / mlir-hello

polymage-labs / mlirx

deepsea-inria / pasl

pigirons / sgemm_hsw

sampsyo / bril

OpenPPL / ppl.nn

merrymercy / awesome-tensor-compilers

marsupialtail / sparsednn

NervanaSystems / maxas

oneapi-src / oneAPI-samples

jmmartinez / easy-just-in-time

Talmaj / onnx2pytorch

jeffhammond / dpcpp-tutorial

bytedance / lightseq

onnx / onnx-mlir

chunyuan-w / ipex_verbose

gangiman / PySparseConvNet

matazure / mtensor