airMeng

🇸🇭

I will not serve

Meng, Hengyu airMeng

🇸🇭

I will not serve

32 followers · 53 following

Achievements

x4 x3

Achievements

x4 x3

Organizations

sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Dec 22, 2025
sycl-tla Public
Forked from intel/sycl-tla

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

C++ BSD 3-Clause "New" or "Revised" License Updated Nov 7, 2025
ao Public
Forked from pytorch/ao

PyTorch native quantization and sparsity for training and inference

Python BSD 3-Clause "New" or "Revised" License Updated Apr 10, 2025
intel-xpu-backend-for-triton Public
Forked from intel/intel-xpu-backend-for-triton

OpenAI Triton backend for Intel® GPUs

MLIR MIT License Updated Mar 24, 2025
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Mar 5, 2025
stable-diffusion.cpp Public
Forked from leejet/stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

C++ MIT License Updated Feb 13, 2025
sycl_joint_matrix_kernels Public
Forked from dkhaldi/sycl_joint_matrix_kernels

GEMM performance kernels for Intel GPUs, Nvidia GPUs, and Intel CPUs, written using SYCL joint matrix extension

C++ Updated Dec 25, 2024
pytorch Public
Forked from pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Other Updated Nov 20, 2024
oneDNN Public
Forked from uxlfoundation/oneDNN

oneAPI Deep Neural Network Library (oneDNN)

C++ Apache License 2.0 Updated Nov 19, 2024
torch-xpu-ops Public
Forked from intel/torch-xpu-ops

C++ Apache License 2.0 Updated Nov 15, 2024
ai_tools Public
Forked from jgong5/ai_tools

Python BSD 3-Clause "New" or "Revised" License Updated Nov 4, 2024
intel-extension-for-pytorch Public
Forked from intel/intel-extension-for-pytorch

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python Apache License 2.0 Updated Jun 13, 2024
llama.cpp Public
Forked from ggml-org/llama.cpp

Port of Facebook's LLaMA model in C/C++

C++ MIT License Updated Feb 21, 2024
onnx-mlir Public
Forked from onnx/onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ Apache License 2.0 Updated Oct 11, 2022
mlir-hello Public
Forked from Lewuathe/mlir-hello

MLIR Sample dialect

Updated Aug 30, 2022
xbyak Public
Forked from herumi/xbyak

a JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header

C++ BSD 3-Clause "New" or "Revised" License Updated Apr 27, 2022
libxsmm Public
Forked from libxsmm/libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C BSD 3-Clause "New" or "Revised" License Updated Mar 21, 2022
sparsednn Public
Forked from marsupialtail/sparsednn

Fast sparse deep learning on CPUs

Python 1 Apache License 2.0 Updated Jan 19, 2022
onnxruntime Public
Forked from microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ MIT License Updated Nov 30, 2020
llvm Public
Forked from intel/llvm

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

C++ Updated Aug 10, 2020
caffe Public
Forked from intel/caffe

This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® Xeon processors.

C++ Other Updated Apr 21, 2020
models Public
Forked from tensorflow/models

Models and examples built with TensorFlow

Python Apache License 2.0 Updated Jan 15, 2020
MidStateCompare Public

model mid state comparison tools for pytorch

pytorch

Python 2 Updated Jun 4, 2019
mask-time-series-modeling Public

Python 3 1 Updated May 7, 2019
MetaNN Public
Forked from liwei-cpp/MetaNN

C++ Other Updated Apr 30, 2019
pytorch-profiling-tool Public
Forked from zhuwenxi/pytorch-profiling-tool

profiling tools for pytorch

profiler pytorch pytorch-implementation

Python 2 Updated Apr 16, 2019
tsexplain Public
Forked from isaksamsten/tsexplain

Jupyter Notebook Updated Apr 5, 2019
keras-transformer Public
Forked from CyberZHG/keras-transformer

Transformer implemented in Keras

Python MIT License Updated Dec 26, 2018
CUDA_to_SYCL_examples Public
Forked from codeplaysoftware/CUDA_to_SYCL_examples

Example code for the guide

C++ Other Updated Oct 24, 2018
UnsupervisedMT Public
Forked from facebookresearch/UnsupervisedMT

Phrase-Based & Neural Unsupervised Machine Translation

Python Other Updated Sep 21, 2018

Meng, Hengyu airMeng

Achievements

Achievements

Organizations

sglang Public

Uh oh!

sycl-tla Public

Uh oh!

ao Public

Uh oh!

intel-xpu-backend-for-triton Public

Uh oh!

vllm Public

Uh oh!

stable-diffusion.cpp Public

Uh oh!

sycl_joint_matrix_kernels Public

Uh oh!

pytorch Public

Uh oh!

oneDNN Public

Uh oh!

torch-xpu-ops Public

Uh oh!

ai_tools Public

Uh oh!

intel-extension-for-pytorch Public

Uh oh!

llama.cpp Public

Uh oh!

onnx-mlir Public

Uh oh!

mlir-hello Public

Uh oh!

xbyak Public

Uh oh!

libxsmm Public

Uh oh!

sparsednn Public

Uh oh!

onnxruntime Public

Uh oh!

llvm Public

Uh oh!

caffe Public

Uh oh!

models Public

Uh oh!

MidStateCompare Public

Uh oh!

mask-time-series-modeling Public

Uh oh!

MetaNN Public

Uh oh!

pytorch-profiling-tool Public

Uh oh!

tsexplain Public

Uh oh!

keras-transformer Public

Uh oh!

CUDA_to_SYCL_examples Public

Uh oh!

UnsupervisedMT Public

Uh oh!