Skip to content
View Luhaocong's full-sized avatar
  • Hangzhou, China
  • 21:55 (UTC +08:00)

Organizations

@llvm

Block or report Luhaocong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 4,667 655 Updated Oct 8, 2025

Google's Operations Research tools:

C++ 12,562 2,292 Updated Oct 9, 2025

Stack trace visualizer

Perl 18,807 2,056 Updated Oct 20, 2024

RISC-V Architecture Profiles

Makefile 166 44 Updated Sep 3, 2025

FlatBuffers: Memory Efficient Serialization Library

C++ 24,840 3,404 Updated Sep 25, 2025

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,661 474 Updated Apr 13, 2024

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 94,655 10,607 Updated Oct 9, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 11,799 1,786 Updated Oct 9, 2025

A new (MLIR based) high-level IR for clang.

LLVM 538 171 Updated Oct 9, 2025

mold: A Modern Linker 🦠

C++ 15,746 517 Updated Oct 9, 2025

Following the RISC-V IME extension standard, and reusing Vector register resources, these instructions can bring more than a tenfold performance improvement to AI applications at a very small hardw…

Makefile 67 7 Updated Aug 14, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 153,777 13,352 Updated Oct 9, 2025

FFMPEG Assembly Language Lessons

9,298 281 Updated Aug 8, 2025

AddressSanitizer, ThreadSanitizer, MemorySanitizer

C 12,136 1,078 Updated Oct 2, 2025

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,685 566 Updated Oct 8, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,554 1,474 Updated Sep 25, 2025

Play with MLIR right in your browser

TypeScript 136 8 Updated May 25, 2023

LLM inference in C/C++

C++ 87,395 13,261 Updated Oct 9, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,651 639 Updated Oct 9, 2025

Unofficial mirror of sourceware binutils-gdb repository. Updated daily.

C 634 625 Updated Oct 9, 2025

Development repository for the Triton language and compiler

MLIR 17,165 2,289 Updated Oct 9, 2025
MLIR 421 74 Updated Oct 8, 2025

Backward compatible ML compute opset inspired by HLO/MHLO

MLIR 549 156 Updated Oct 7, 2025

A model compilation solution for various hardware

MLIR 450 54 Updated Aug 20, 2025

Documentation for XiangShan

Markdown 426 145 Updated Oct 2, 2025

Spike, a RISC-V ISA Simulator

C 2,852 986 Updated Oct 5, 2025

RISC-V cryptography extensions standardisation work.

C 395 93 Updated Mar 8, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,816 1,491 Updated Oct 3, 2025
Next