-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedDec 22, 2025 -
sycl-tla Public
Forked from intel/sycl-tlaSYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs
C++ BSD 3-Clause "New" or "Revised" License UpdatedNov 7, 2025 -
ao Public
Forked from pytorch/aoPyTorch native quantization and sparsity for training and inference
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 10, 2025 -
intel-xpu-backend-for-triton Public
Forked from intel/intel-xpu-backend-for-tritonOpenAI Triton backend for Intel® GPUs
MLIR MIT License UpdatedMar 24, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMar 5, 2025 -
stable-diffusion.cpp Public
Forked from leejet/stable-diffusion.cppStable Diffusion and Flux in pure C/C++
C++ MIT License UpdatedFeb 13, 2025 -
sycl_joint_matrix_kernels Public
Forked from dkhaldi/sycl_joint_matrix_kernelsGEMM performance kernels for Intel GPUs, Nvidia GPUs, and Intel CPUs, written using SYCL joint matrix extension
C++ UpdatedDec 25, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedNov 20, 2024 -
oneDNN Public
Forked from uxlfoundation/oneDNNoneAPI Deep Neural Network Library (oneDNN)
C++ Apache License 2.0 UpdatedNov 19, 2024 -
-
ai_tools Public
Forked from jgong5/ai_toolsPython BSD 3-Clause "New" or "Revised" License UpdatedNov 4, 2024 -
intel-extension-for-pytorch Public
Forked from intel/intel-extension-for-pytorchA Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Python Apache License 2.0 UpdatedJun 13, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C++ MIT License UpdatedFeb 21, 2024 -
onnx-mlir Public
Forked from onnx/onnx-mlirRepresentation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
C++ Apache License 2.0 UpdatedOct 11, 2022 -
-
xbyak Public
Forked from herumi/xbyaka JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header
C++ BSD 3-Clause "New" or "Revised" License UpdatedApr 27, 2022 -
libxsmm Public
Forked from libxsmm/libxsmmLibrary for specialized dense and sparse matrix operations, and deep learning primitives.
C BSD 3-Clause "New" or "Revised" License UpdatedMar 21, 2022 -
sparsednn Public
Forked from marsupialtail/sparsednnFast sparse deep learning on CPUs
-
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedNov 30, 2020 -
llvm Public
Forked from intel/llvmIntel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
C++ UpdatedAug 10, 2020 -
caffe Public
Forked from intel/caffeThis fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® Xeon processors.
C++ Other UpdatedApr 21, 2020 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedJan 15, 2020 -
MidStateCompare Public
model mid state comparison tools for pytorch
-
-
-
pytorch-profiling-tool Public
Forked from zhuwenxi/pytorch-profiling-toolprofiling tools for pytorch
-
-
keras-transformer Public
Forked from CyberZHG/keras-transformerTransformer implemented in Keras
Python MIT License UpdatedDec 26, 2018 -
CUDA_to_SYCL_examples Public
Forked from codeplaysoftware/CUDA_to_SYCL_examplesExample code for the guide
C++ Other UpdatedOct 24, 2018 -
UnsupervisedMT Public
Forked from facebookresearch/UnsupervisedMTPhrase-Based & Neural Unsupervised Machine Translation
Python Other UpdatedSep 21, 2018