Lists (1)
Sort Name ascending (A-Z)
Stars
In-depth exploratory performance analysis and benchmarking of the QEMU emulator using the TCG JIT in both its Linux user and system modes.
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Box64 - Linux Userspace x86_64 Emulator with a twist, targeted at ARM64, RV64 and LoongArch Linux devices
Proofs for the paper "Risotto: A Dynamic Binary Translator for Weak Memory Model Architectures"
A high performance LLVM-based dynamic binary instrumentation framework
Ocolos is the first open-sourced online code layout optimization system for unmodified applications written in unmanaged languages.
A benchmark suited especially for deep learning operators
A group of students who are interested in Compilers, and they want to improve themselves together.
Benchmark Framework for Buddy Projects
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
Elixir is a dynamic, functional language for building scalable and maintainable applications
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A guide that explains how high level programming language constructs are mapped to the LLVM intermediate language.
Deeplang is a new language for IoT device programming.
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
LaTeX Thesis Template for the University of Chinese Academy of Sciences