Lists (15)
Sort Name ascending (A-Z)
Stars
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
Cursor IDE extension that automatically re-enables the "OpenAI API Key" toggle when it randomly resets itself.
🏡 Open source home automation that puts local control and privacy first.
Xiaomi Home Integration for Home Assistant
🔰 Home Assistant Operating System
Create Open XML PowerPoint documents in Python
MLIR Inc Previewer - VS Code Extension
Konata is an instruction pipeline visualizer for Onikiri2-Kanata/Gem5-O3PipeView formats. You can download the pre-built binaries from https://github.com/shioyadan/Konata/releases
Fast CUDA matrix multiplication from scratch
Asynchronous semantics for architectural simulation and synthesis.
Unofficial description of the CUDA assembly (SASS) instruction sets.
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…
This is a beginner-friendly tutorial on MLIR from the perspective of a user of MLIR, not a compiler engineer. This tutorial will introduce why MLIR exists and how it is used to compile code at diff…
An energy-efficient RISC-V floating-point compute cluster.
SST Architectural Simulation Components and Libraries
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
Open-source simulator for autonomous driving research.
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
cuTile is a programming model for writing parallel kernels for NVIDIA GPUs
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
PyCacheGen: A Highly Configurable Open-Source Generator for Synthesizable Caches
A machine learning accelerator core designed for energy-efficient AI at the edge.
GPUOcelot: A dynamic compilation framework for PTX
PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework
Tracking acceptance rates at global CS/AI conferences. This repository contains metadata for building the OpenAccept main site.