Stars
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
A high-throughput and memory-efficient inference and serving engine for LLMs
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Development repository for the Triton language and compiler
muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.
A retargetable MLIR-based machine learning compiler and runtime toolkit.
A machine learning compiler for GPUs, CPUs, and ML accelerators
A framework that support executing unmodified CUDA source code on non-NVIDIA devices.
NVIDIA curated collection of educational resources related to general purpose GPU programming.
GPGPU processor supporting RISCV-V extension, developed with Chisel HDL
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Open-source super AI assistant & Agent Harness. Plans tasks, runs tools and skills, self-evolves with memory and knowledge. Multi-model, multi-channel. Lightweight, extensible, one-line install. (f…
qrb_ros_nn_inference is a ros2 package for performing neural network model, providing AI-based perception for robotics applications.
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
free online AI resume editor,the only official website is https://magicv.art
AI agents running research on single-GPU nanochat training automatically
QRB ROS Transport is designed for zero-copy transporting ROS messages on Qualcomm robotics platforms.
dmabuf_transport is a package for zero-copy transport ROS message with Linux dma-buf file descriptor.
Build and run containers leveraging NVIDIA GPUs
omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode
NVIDIA Isaac Transport for ROS package for hardware-acceleration friendly movement of messages
Python tool for converting files and office documents to Markdown.