Skip to content
View thynics's full-sized avatar

Block or report thynics

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM inference in C/C++

C++ 98,981 15,708 Updated Mar 23, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 73,970 14,635 Updated Mar 22, 2026

A post-modern modal text editor.

Rust 43,597 3,370 Updated Mar 20, 2026

程序员延寿指南 | A programmer's guide to live longer

34,992 2,380 Updated May 19, 2025

Everything you need to move your project faster

Rust 21,155 840 Updated Mar 18, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,493 2,342 Updated Sep 3, 2025

CUDA on non-NVIDIA GPUs

Rust 14,034 900 Updated Mar 23, 2026

本人的科研经验

10,867 560 Updated Mar 7, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,974 993 Updated Mar 20, 2026

Material for gpu-mode lectures

Jupyter Notebook 5,869 587 Updated Feb 1, 2026

😏国内外计算机的优秀课程,包含MIT、CMU等世界CS名校,🔥🔥其中包含计算机基础学科(操作系统、计算机网络、编译器、数据库、数据结构与算法等)以及人工智能&AI等高级科目,欢迎通过PR形式贡献!

1,754 194 Updated Apr 18, 2023

[TMLR 2024] Efficient Large Language Models: A Survey

1,256 98 Updated Jun 23, 2025

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,255 177 Updated Jul 29, 2023

My curriculum vitae (CV) written using LaTeX.

TeX 908 275 Updated Sep 11, 2024

List of papers related to neural network quantization in recent AI conferences and journals.

812 61 Updated Mar 27, 2025

hpc-learning

783 45 Updated May 30, 2024

欧港新CS留学项目指北

HTML 773 62 Updated Aug 25, 2025

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 411 37 Updated Aug 13, 2024

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 362 44 Updated Nov 20, 2025

[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Python 229 18 Updated Jan 11, 2025

A parallel programming training mini app simulating weather-like flows

C++ 178 82 Updated Aug 11, 2025

Wiki fo HPC

Python 137 13 Updated Jul 23, 2025

BGHT: High-performance static GPU hash tables.

C++ 72 9 Updated Jul 2, 2025

zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation

C 29 18 Updated May 10, 2021

Scalable radix top-k selection on GPUs.

Cuda 21 3 Updated Jan 27, 2025

a Model-Free GPU Online Energy Optimization (MF-GPOEO) framework

C++ 4 2 Updated Dec 11, 2023

A quick survival guild for i18n students who comes to chalmers.

SCSS 4 2 Updated Nov 18, 2023

XiTAO is a lightweight layer built on top of modern C++ features with the goals of being low-overhead and serving as a development platform for testing scheduling and resource management algorithms.

C++ 2 1 Updated Jun 2, 2021