Skip to content
View i-Pear's full-sized avatar
:electron:
practicing magic
:electron:
practicing magic
  • Nanjing University
  • 05:16 (UTC +08:00)

Organizations

@NEUP-Net-Depart @gsoc-cn @unikraft @HMUniversity @ipearworks

Block or report i-Pear

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,496 603 Updated Jun 14, 2026

Programmable CUDA/C++ GPU Graph Analytics

C++ 1,092 223 Updated Feb 28, 2026

MLIR-based partitioning system

MLIR 191 37 Updated Jun 14, 2026

PROPELLER: Profile Guided Optimizing Large Scale LLVM-based Relinker

C++ 525 45 Updated May 29, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,183 361 Updated Dec 9, 2023

PyTorch extensions for high performance and large scale training.

Python 3,408 298 Updated Apr 26, 2025

Making large AI models cheaper, faster and more accessible

Python 41,398 4,512 Updated May 25, 2026

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,251 365 Updated Aug 14, 2025

LingoDB: A new analytical database system that blurs the lines between databases and compilers.

C++ 309 63 Updated Jun 12, 2026

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,523 191 Updated Jun 14, 2026

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 751 138 Updated Dec 20, 2025

A list of bugs found by SQLancer

Python 17 6 Updated Jan 30, 2024

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 418 52 Updated Jan 2, 2025

2025届互联网校招信息汇总

845 48 Updated Feb 24, 2025

Material for gpu-mode lectures

Jupyter Notebook 6,176 623 Updated May 9, 2026

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,804 927 Updated Jun 14, 2026

A lightweight memory allocator for hardware-accelerated machine learning

C++ 188 15 Updated Apr 6, 2026

Keep your bugs contained. A platform for studying historical software bugs.

Python 70 12 Updated Jan 8, 2025

“Debian 小药盒”,一个用来包装 Debian 安装介质的盒子设计和介绍用的说明书。

TeX 1,874 79 Updated Aug 10, 2025

A massively parallel, optimal functional runtime in Rust

Cuda 11,280 434 Updated Nov 21, 2024

💥💻💥 A data-parallel functional programming language

Haskell 2,739 200 Updated Jun 14, 2026

A massively parallel, high-level programming language

Rust 19,457 479 Updated Jun 3, 2025

Training materials provided by OpenACC.org.

C 98 30 Updated Aug 6, 2024

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

C++ 617 164 Updated Jun 19, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,312 220 Updated Jun 13, 2026

NO TIME TO SLEEP

Python 643 23 Updated May 26, 2024

Tile primitives for speedy kernels

Cuda 3,429 295 Updated May 27, 2026

Hands-On Practical MLIR Tutorial

C++ 792 126 Updated Oct 20, 2023

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,750 1,792 Updated Jun 14, 2026

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,757 326 Updated Oct 19, 2024
Next