Skip to content
View i-Pear's full-sized avatar
:electron:
practicing magic
:electron:
practicing magic
  • Nanjing University
  • 22:12 (UTC +08:00)

Organizations

@NEUP-Net-Depart @gsoc-cn @unikraft @HMUniversity @ipearworks

Block or report i-Pear

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,527 609 Updated Jun 22, 2026

Programmable CUDA/C++ GPU Graph Analytics

C++ 1,093 223 Updated Feb 28, 2026

MLIR-based partitioning system

MLIR 191 37 Updated Jun 22, 2026

PROPELLER: Profile Guided Optimizing Large Scale LLVM-based Relinker

C++ 528 46 Updated May 29, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,181 361 Updated Dec 9, 2023

PyTorch extensions for high performance and large scale training.

Python 3,410 298 Updated Apr 26, 2025

Making large AI models cheaper, faster and more accessible

Python 41,402 4,506 Updated May 25, 2026

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,254 366 Updated Aug 14, 2025

LingoDB: A new analytical database system that blurs the lines between databases and compilers.

C++ 309 63 Updated Jun 22, 2026

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,527 191 Updated Jun 19, 2026

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 752 137 Updated Dec 20, 2025

A list of bugs found by SQLancer

Python 17 6 Updated Jan 30, 2024

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 419 52 Updated Jan 2, 2025

2025届互联网校招信息汇总

844 48 Updated Feb 24, 2025

Material for gpu-mode lectures

Jupyter Notebook 6,198 624 Updated Jun 15, 2026

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,819 933 Updated Jun 22, 2026

A lightweight memory allocator for hardware-accelerated machine learning

C++ 188 15 Updated Apr 6, 2026

Keep your bugs contained. A platform for studying historical software bugs.

Python 70 12 Updated Jan 8, 2025

“Debian 小药盒”,一个用来包装 Debian 安装介质的盒子设计和介绍用的说明书。

TeX 1,883 79 Updated Aug 10, 2025

A massively parallel, optimal functional runtime in Rust

Cuda 11,291 435 Updated Nov 21, 2024

💥💻💥 A data-parallel functional programming language

Haskell 2,742 201 Updated Jun 21, 2026

A massively parallel, high-level programming language

Rust 19,482 481 Updated Jun 3, 2025

Training materials provided by OpenACC.org.

C 98 30 Updated Aug 6, 2024

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

C++ 619 166 Updated Jun 19, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,328 223 Updated Jun 19, 2026

NO TIME TO SLEEP

Python 643 23 Updated May 26, 2024

Tile primitives for speedy kernels

Cuda 3,464 299 Updated Jun 15, 2026

Hands-On Practical MLIR Tutorial

C++ 797 125 Updated Oct 20, 2023

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,768 1,796 Updated Jun 19, 2026

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,760 326 Updated Oct 19, 2024
Next