Skip to content
View lilohuang's full-sized avatar
🚀
24/7/365
🚀
24/7/365

Highlights

  • Pro

Block or report lilohuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reverse engineering notes. Personal reference only. Everything here is a best-guess reconstruction.

Python 48 8 Updated Jun 10, 2026

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,320 1,178 Updated Mar 14, 2025

ComfyUI-Downloader

JavaScript 42 10 Updated Jan 19, 2026

nsswitch support for subuid subgid

C 1 2 Updated Apr 15, 2026

Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents

TypeScript 4,840 394 Updated Jul 28, 2025

LLM inference in C/C++

C++ 116,362 19,539 Updated Jun 13, 2026

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

LLVM 1,487 835 Updated Jun 13, 2026

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

C# 4,204 416 Updated Jun 12, 2026

C++ library implementing recent double-word (aka double-double) arithmetics.

C++ 17 2 Updated Mar 3, 2026

Worked example of the process from Python source to CUDA kernel execution with Numba

Jupyter Notebook 45 5 Updated Sep 11, 2024

The CUDA target for Numba

Python 287 67 Updated Jun 13, 2026

Samples of good AI generated CUDA kernels

Python 105 11 Updated May 30, 2025

Effective transpose on Hopper GPU

Cuda 29 3 Updated Sep 6, 2025

Source code that accompanies The CUDA Handbook.

Cuda 575 198 Updated Mar 10, 2026

Doing non-Cartesian MR Imaging has never been so easy.

Python 123 24 Updated Jun 12, 2026

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

Python 6,754 527 Updated Jun 13, 2026

Automatically exported from code.google.com/p/smhasher

C++ 2,875 485 Updated Nov 14, 2024

Multi-GPU CUDA stress test

C++ 2,228 408 Updated May 31, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,861 2,464 Updated Jun 13, 2026

A scheduler for GPU/CPU tasks

C 422 39 Updated Mar 6, 2024

AMD ROCm™ Software - GitHub Home

Shell 6,604 569 Updated Jun 12, 2026

CUDA Core Compute Libraries

C++ 2,378 410 Updated Jun 13, 2026

Hooked CUDA-related dynamic libraries by using automated code generation tools.

C 173 48 Updated Dec 12, 2023

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

C++ 2,569 506 Updated Jun 12, 2026

Polygon Clipping, Offsetting & Triangulation in C++, C# and Delphi

C++ 2,363 429 Updated Apr 20, 2026

CUDA 12.2 HMM demos

Cuda 21 8 Updated Jul 26, 2024
LLVM 288 98 Updated Jun 12, 2026

oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html

C++ 773 120 Updated Jun 12, 2026
Next