Skip to content
View lijiansong's full-sized avatar
🛠️
造轮子...
🛠️
造轮子...

Organizations

@googol-lab @SpMV-Opt

Block or report lijiansong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The btbench benchmark suite

Go 4 2 Updated Oct 30, 2024

In-depth exploratory performance analysis and benchmarking of the QEMU emulator using the TCG JIT in both its Linux user and system modes.

C 19 5 Updated May 10, 2024

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,132 359 Updated Apr 9, 2026

Box64 - Linux Userspace x86_64 Emulator with a twist, targeted at ARM64, RV64 and LoongArch Linux devices

C 5,331 421 Updated Apr 9, 2026

Proofs for the paper "Risotto: A Dynamic Binary Translator for Weak Memory Model Architectures"

Agda 5 Updated Sep 13, 2022

A high performance LLVM-based dynamic binary instrumentation framework

C 286 43 Updated Jun 7, 2024

Rule-based Dynamic Binary Translator

C 12 4 Updated Sep 15, 2020

A series of posts about QEMU internals:

1,523 161 Updated Nov 3, 2023

The Modular Platform (includes MAX & Mojo)

Mojo 25,852 2,797 Updated Apr 9, 2026

Lift machine code to performant LLVM IR

C++ 499 45 Updated Jun 17, 2024

Ocolos is the first open-sourced online code layout optimization system for unmodified applications written in unmanaged languages.

C++ 53 16 Updated Apr 9, 2026

A benchmark suited especially for deep learning operators

Python 42 5 Updated Feb 13, 2023

A group of students who are interested in Compilers, and they want to improve themselves together.

25 Updated Aug 23, 2022

Benchmark Framework for Buddy Projects

C 54 49 Updated Oct 31, 2025

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

Python 706 247 Updated Apr 9, 2026

Elixir is a dynamic, functional language for building scalable and maintainable applications

Elixir 26,412 3,481 Updated Apr 9, 2026

Erlang/OTP

Erlang 12,107 3,060 Updated Apr 9, 2026

教科书《计算机体系结构基础》(胡伟武等,第三版)的开源版本

TeX 3,330 316 Updated Nov 20, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,872 250 Updated Mar 25, 2026

AI and Memory Wall

226 26 Updated Mar 23, 2024

Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616

Python 133 18 Updated Jul 6, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,027 4,776 Updated Apr 9, 2026

Benchmark scripts for TVM

Python 74 28 Updated Mar 15, 2022

A guide that explains how high level programming language constructs are mapped to the LLVM intermediate language.

LLVM 653 64 Updated Jan 25, 2026

Deeplang is a new language for IoT device programming.

82 15 Updated Feb 22, 2023

Distributed Multi-GPU GNN Framework

C++ 36 10 Updated Jun 26, 2020

DaCe - Data Centric Parallel Programming

Python 582 155 Updated Mar 30, 2026

Automatic Schedule Exploration and Optimization Framework for Tensor Computations

Python 184 32 Updated Apr 25, 2022

LaTeX Thesis Template for the University of Chinese Academy of Sciences

TeX 3,812 943 Updated Feb 29, 2024
Next