Chtholly-Boss

Follow

🫡

Chtholly Chtholly-Boss

🫡

Follow

Chtholly is the happiest girl in the world !!!

14 followers · 26 following

HITSZ
GuangDong Shenzhen

Achievements

Achievements

Highlights

Pro

Lists (7)

Sort

Awesome-X

Courses

Courses or Tutorials

14 repositories

Cpp Header Libs

Header-Only Libraries

GPGPU Programming

23 repositories

Resources

Collection of open source resources

Tiny/Nano-X

Tiny Projects for educational purpose

Tools

Simple but quite good tools

24 repositories

Starred repositories

apache / tvm-ffi

Open ABI and FFI for Machine Learning Systems

C++ 258 43 Updated Dec 23, 2025

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,238 190 Updated Dec 23, 2025

ManimCommunity / manim

A community-maintained Python framework for creating mathematical animations.

Python 36,111 2,579 Updated Dec 22, 2025

0xD0GF00D / DocumentSASS

Unofficial description of the CUDA assembly (SASS) instruction sets.

Python 183 18 Updated Jul 18, 2025

QianyanTech / NBAssembler

Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.

Python 95 10 Updated Feb 23, 2023

PAA-NCIC / PPoPP2017_artifact

Third party assembler and GEMM library for NVIDIA Kepler GPU

CSS 85 21 Updated Oct 8, 2019

kuterd / nv_isa_solver

Nvidia Instruction Set Specification Generator

Python 305 17 Updated Jul 9, 2024

NVIDIA / TileGym

Helpful kernel tutorials and examples for tile-based GPU programming

Python 475 26 Updated Dec 23, 2025

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,659 86 Updated Dec 20, 2025

pytorch / gloo

Collective communications library with various primitives for multi-machine training.

C++ 1,380 340 Updated Dec 2, 2025

sansan0 / TrendRadar

🎯 告别信息过载，AI 助你看懂新闻资讯热点，简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台（抖音、知乎、B站、华尔街见闻、财联社等），智能筛选+自动推送+AI对话分析（用自然语言深度挖掘新闻：趋势追踪、情感分析、相似检索等13种工具）。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送，1分钟手机通知，无需…

Python 40,200 20,837 Updated Dec 23, 2025

rapidsai / cuvs

cuVS - a library for vector search and clustering on the GPU

Cuda 598 150 Updated Dec 23, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,974 454 Updated Dec 23, 2025

cloudcores / CuAssembler

An unofficial cuda assembler, for all generations of SASS, hopefully ：）

Python 562 96 Updated Apr 20, 2023

CherryHQ / cherry-studio

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 36,877 3,388 Updated Dec 23, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,327 1,096 Updated Dec 2, 2025

ademeure / cuda-side-boost

Cuda 52 4 Updated May 5, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 21,262 2,244 Updated Dec 23, 2025

cfregly / ai-performance-engineering

Python 832 112 Updated Dec 23, 2025

modular / modular

The Modular Platform (includes MAX & Mojo)

Mojo 25,377 2,745 Updated Dec 23, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,827 1,036 Updated Dec 23, 2025

RRZE-HPC / gpu-benches

collection of benchmarks to measure basic GPU capabilities

C++ 476 72 Updated Oct 24, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 714 64 Updated Dec 23, 2025

typst / typst

A markup-based typesetting system that is powerful and easy to learn.

Rust 49,810 1,375 Updated Dec 22, 2025

lewish / asciiflow

ASCIIFlow

TypeScript 5,420 396 Updated Oct 27, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,423 637 Updated Dec 23, 2025

microsoft / T-MAC

Low-bit LLM inference on CPU/NPU with lookup table

C++ 902 74 Updated Jun 5, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,341 614 Updated Dec 23, 2025

anilshanbhag / gpu-topk

Efficient Top-K implementation on the GPU

Cuda 191 24 Updated Apr 9, 2019

hamvocke / dotfiles

A collection of my personal dotfiles

Lua 637 96 Updated Dec 17, 2025

Starred topics

Machine learning

Linux