-
MooreThreads, AMD
- Shanghai
-
rocm-libraries Public
Forked from ROCm/rocm-librariessuper repo for rocm libraries
Assembly UpdatedApr 30, 2026 -
llvm-project-amd Public
Forked from ROCm/llvm-projectThis is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit ups…
LLVM Other UpdatedApr 9, 2026 -
ZLUDA---CUDA-Compiler Public
Forked from vosen/ZLUDACUDA on non-NVIDIA GPUs
Rust Apache License 2.0 UpdatedApr 6, 2026 -
BarraCUDA---CUDA-compiler Public
Forked from Zaneham/BarraCUDAOpen-source CUDA compiler targeting multiple GPU architectures. Compiles .cu to AMD and Tenstorrent GPU's
C Apache License 2.0 UpdatedMar 25, 2026 -
-
vortex-riscv-gpgpu Public
Forked from vortexgpgpu/vortexVerilog Apache License 2.0 UpdatedMar 13, 2026 -
CuPBoP_Vortex_CUDA Public
Forked from cupbop/CuPBoP_Vortexvortex backend
C++ MIT License UpdatedMar 13, 2026 -
ray Public
Forked from ray-project/rayRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedNov 26, 2025 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedNov 26, 2025 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedOct 27, 2025 -
ultralytics Public
Forked from ultralytics/ultralyticsUltralytics YOLO11 🚀
Python GNU Affero General Public License v3.0 UpdatedOct 16, 2025 -
vortex-toolchain-prebuilt Public
Forked from vortexgpgpu/vortex-toolchain-prebuiltUpdatedSep 22, 2025 -
TensorRT-Model-Optimizer Public
Forked from NVIDIA/Model-OptimizerA unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…
Python Apache License 2.0 UpdatedSep 9, 2025 -
AISystem Public
Forked from Infrasys-AI/AISystemAISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Jupyter Notebook Apache License 2.0 UpdatedSep 3, 2025 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedJul 17, 2025 -
mirage-llm-megakernel Public
Forked from mirage-project/mirageMirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
C++ Apache License 2.0 UpdatedJun 22, 2025 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedJun 19, 2025 -
torch_audio Public
Forked from pytorch/audioData manipulation and transformation for audio signal processing, powered by PyTorch
Python BSD 2-Clause "Simplified" License UpdatedApr 16, 2025 -
torch_vision Public
Forked from pytorch/visionDatasets, Transforms and Models specific to Computer Vision
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 16, 2025 -
accelerated-computing-hub Public
Forked from NVIDIA/accelerated-computing-hubNVIDIA curated collection of educational resources related to general purpose GPU programming.
Jupyter Notebook Other UpdatedMar 15, 2025 -
distributed-llama Public
Forked from b4rtaz/distributed-llamaConnect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
C++ MIT License UpdatedMar 10, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedMar 10, 2025 -
ktransformers Public
Forked from kvcache-ai/ktransformersA Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Python Apache License 2.0 UpdatedMar 5, 2025 -
Wan2.1 Public
Forked from Wan-Video/Wan2.1Wan: Open and Advanced Large-Scale Video Generative Models
Python Apache License 2.0 UpdatedMar 4, 2025 -
stable-diffusion.cpp Public
Forked from leejet/stable-diffusion.cppStable Diffusion and Flux in pure C/C++
C++ MIT License UpdatedMar 1, 2025 -
ollama Public
Forked from ollama/ollamaGet up and running with Llama 2, Mistral, and other large language models locally.
Go MIT License UpdatedFeb 27, 2025 -
executorch Public
Forked from pytorch/executorchOn-device AI across mobile, embedded and edge for PyTorch
C++ Other UpdatedFeb 5, 2025 -
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python MIT License UpdatedDec 15, 2024 -
LLaMA-MoE-v2 Public
Forked from OpenSparseLLMs/LLaMA-MoE-v2🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
Python Apache License 2.0 UpdatedDec 12, 2024 -
LocalAI Public
Forked from mudler/LocalAI🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
C++ MIT License UpdatedOct 24, 2024