xinhaoc

🕶️

Focusing

Xinhao Cheng xinhaoc

🕶️

Focusing

36 followers · 15 following

Carnegie Mellon University
Pittsburgh, PA
04:54 (UTC -08:00)
https://xinhaoc.github.io

Achievements

x2 x2

Achievements

x2 x2

Stars

32 results for source starred repositories

Clear filter

reHackable / awesome-reMarkable

A curated list of projects related to the reMarkable tablet

7,223 247 Updated Feb 2, 2026

NVIDIA / cuda-tile

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 825 60 Updated Jan 14, 2026

infinigence / FlashOverlap

A lightweight design for computation-communication overlap.

Python 219 10 Updated Jan 20, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,052 846 Updated Feb 8, 2026

MekkCyber / CutlassAcademy

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

251 12 Updated May 6, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 4,913 698 Updated Feb 8, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,458 981 Updated Feb 6, 2026

gavinliu6 / Makefile-Tutorial-zh-CN

Makefile 教程

HTML 302 35 Updated Mar 4, 2024

facebookexperimental / triton

Github mirror of trition-lang/triton repo.

MLIR 128 37 Updated Feb 8, 2026

flexflow / flexflow-serve

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 72 8 Updated Sep 15, 2025

GenseeAI / cognify

Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower execution latency, and lower execution cost. Also has a simple …

Python 269 32 Updated May 16, 2025

ColfaxResearch / cfx-article-src

C++ 175 34 Updated May 7, 2025

lynnboy / CppCoreGuidelines-zh-CN

Translation of C++ Core Guidelines [https://github.com/isocpp/CppCoreGuidelines] into Simplified Chinese.

2,523 348 Updated Dec 22, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,120 172 Updated Jan 29, 2026

timovv / notion-website-template

Make a personal website using Notion and GitHub Pages

Shell 142 66 Updated Oct 27, 2023

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,241 1,664 Updated Feb 4, 2026

jiazhihao / attention_superoptimizer

An Attention Superoptimizer

C++ 22 Updated Jan 20, 2025

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 23,836 1,500 Updated Feb 8, 2026

jonbarron / jonbarron.github.io

HTML 3,428 2,900 Updated Jan 17, 2026

Timothyxxx / RetrivalLMPapers

Paper collections of retrieval-based (augmented) language model.

232 12 Updated May 24, 2024

lambda7xx / awesome-AI-system

paper and its code for AI System

347 23 Updated Dec 13, 2025

mlc-ai / tokenizers-cpp

Universal cross-platform tokenizers binding to HF and sentencepiece

C++ 451 111 Updated Jan 23, 2026

Tony-Tan / CUDA_Freshman

Cuda 2,686 503 Updated Jan 16, 2024

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,859 248 Updated Feb 7, 2026

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,383 6,229 Updated Sep 18, 2024

aermin / blog

📝 My blog / notes

245 34 Updated Sep 22, 2022

davidbau / how-to-read-pytorch

Quick, visual, principled introduction to pytorch code through five colab notebooks.

Jupyter Notebook 460 71 Updated Jan 13, 2025

datawhalechina / learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

Shell 3,099 497 Updated Aug 18, 2024

Helsinki-NLP / Tatoeba-Challenge

Makefile 845 90 Updated Aug 20, 2024

sail-sg / envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,265 128 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xinhao Cheng xinhaoc

Achievements

Achievements

Block or report xinhaoc

Stars

reHackable / awesome-reMarkable

NVIDIA / cuda-tile

infinigence / FlashOverlap

ai-dynamo / dynamo

MekkCyber / CutlassAcademy

flashinfer-ai / flashinfer

deepseek-ai / FlashMLA

gavinliu6 / Makefile-Tutorial-zh-CN

facebookexperimental / triton

flexflow / flexflow-serve

GenseeAI / cognify

ColfaxResearch / cfx-article-src

lynnboy / CppCoreGuidelines-zh-CN

mirage-project / mirage

timovv / notion-website-template

NVIDIA / cutlass

jiazhihao / attention_superoptimizer

ml-explore / mlx

jonbarron / jonbarron.github.io

Timothyxxx / RetrivalLMPapers

lambda7xx / awesome-AI-system

mlc-ai / tokenizers-cpp

Tony-Tan / CUDA_Freshman

flexflow / flexflow-train

facebookresearch / segment-anything

aermin / blog

davidbau / how-to-read-pytorch

datawhalechina / learn-nlp-with-transformers

Helsinki-NLP / Tatoeba-Challenge

sail-sg / envpool