Skip to content
View imp1947's full-sized avatar
❤️
Open Source
❤️
Open Source

Block or report imp1947

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best-benchmarked open-source AI memory system. And it's free.

Python 47,420 6,192 Updated Apr 17, 2026

Research on Coding Agents

11,667 19,741 Updated Apr 1, 2026

XLS: Accelerated HW Synthesis

C++ 1,471 226 Updated Apr 17, 2026

AI agents running research on single-GPU nanochat training automatically

Python 73,723 10,733 Updated Mar 26, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,497 508 Updated Apr 17, 2026

Open Machine Learning Compiler Framework

Python 13,277 3,857 Updated Apr 17, 2026

AI Tensor Engine for ROCm

Python 406 281 Updated Apr 17, 2026

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

C++ 940 78 Updated Apr 1, 2026

本书为《C++17 the complete guide》的个人中文翻译,仅供学习和交流使用,侵删

TeX 1,755 275 Updated Mar 23, 2026

2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等

C++ 6,291 1,254 Updated Jun 18, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,985 5,423 Updated Apr 17, 2026

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 111,397 8,119 Updated Apr 17, 2026

This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".

Python 124 4 Updated Sep 24, 2025

Tile primitives for speedy kernels

Cuda 3,321 278 Updated Apr 8, 2026

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,422 772 Updated Apr 17, 2026

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 353 48 Updated Jun 18, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,892 2,800 Updated Apr 17, 2026

A Quirky Assortment of CuTe Kernels

Python 931 112 Updated Apr 16, 2026

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 12,214 1,139 Updated Aug 18, 2024

The official repository for the gem5 computer-system architecture simulator.

C++ 2,574 1,776 Updated Apr 17, 2026

NumPy and SciPy on Multi-Node Multi-GPU systems

Python 968 87 Updated Apr 16, 2026

Approaching (Almost) Any Machine Learning Problem

8,308 1,124 Updated Mar 25, 2023

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 121,268 13,311 Updated Apr 17, 2026

GNU toolchain for RISC-V, including GCC

C 4,448 1,379 Updated Apr 5, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,558 1,008 Updated Apr 7, 2026

Verilator open-source SystemVerilog simulator and lint system

SystemVerilog 3,539 795 Updated Apr 17, 2026

Icarus Verilog

C++ 3,407 594 Updated Apr 15, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 9,087 2,319 Updated Mar 30, 2026

Lex yacc tutorial

Lex 1 Updated Oct 15, 2020

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,594 1,795 Updated Apr 17, 2026
Next