-
composable_kernel Public
Forked from ROCm/composable_kernelComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
C++ Other UpdatedNov 26, 2025 -
aiter Public
Forked from ROCm/aiterAI Tensor Engine for ROCm
Python MIT License UpdatedSep 11, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedJun 12, 2025 -
ray Public
Forked from ray-project/rayRay is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
-
optimum-habana Public
Forked from huggingface/optimum-habanaEasy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Python Apache License 2.0 UpdatedSep 26, 2024 -
-
raydp Public
Forked from ray-project/raydpRayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
Python Apache License 2.0 UpdatedNov 7, 2023 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedAug 30, 2023 -
jax Public
Forked from jax-ml/jaxComposable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Apache License 2.0 UpdatedAug 30, 2023