Stars
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
Goal: Enable awesome tooling for Bazel users of the C language family.
OLMoE: Open Mixture-of-Experts Language Models
Curated collection of papers in machine learning systems
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
Machine learning compiler based on MLIR for Sophgo TPU.
Implementation of Reliable Feature-Line Driven Quad-Remeshing
QuadriFlow: A Scalable and Robust Method for Quadrangulation
Development repository for the Triton-Linalg conversion
FlagGems is an operator library for large language models implemented in the Triton Language.
A collection of memory efficient attention operators implemented in the Triton language.
jax-triton contains integrations between JAX and OpenAI Triton
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
Power efficient dashboard for Kindle 4 NT devices
OpenAI Triton backend for Intel® GPUs
An open-source efficient deep learning framework/compiler, written in python.
Trixi.jl: Adaptive high-order numerical simulations of conservation laws in Julia
High Order Hex-Quad Mesh (HOHQMesh) package to automatically generate all-quadrilateral meshes with high order boundary information.