Lists (3)
Sort Name ascending (A-Z)
Stars
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
OpenCL API, OpenCL C, Extensions, SPIR-V Environment Specs, Ref page, and C++ for OpenCL doc sources.
A guide to help developers get up and running quickly with the OpenCL programming framework
Integer Set Library (source repository: http://repo.or.cz/w/isl.git)
FlashMLA: Efficient Multi-head Latent Attention Kernels
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Simple development server with live-reload capability for Julia.
OpenCL integration for Python, plus shiny features
A code generator for array-based code on CPUs and GPUs
Repository reproducing CPU and GPU semidiscretization performance benchmarks presented at JuliaCon 2025
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
A benchmarking framework for the Julia language
High Order Hex-Quad Mesh (HOHQMesh) package to automatically generate all-quadrilateral meshes with high order boundary information.
CUDA integration for Python, plus shiny features
Tools for easily handling objects like arrays of arrays and deeper nestings in scientific machine learning (SciML) and other applications
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
High-performance automatic differentiation of LLVM and MLIR.
Benchmarks of approximate nearest neighbor libraries in Python
A library for efficient similarity search and clustering of dense vectors.