Skip to content
View thynics's full-sized avatar

Block or report thynics

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
30 results for source starred repositories
Clear filter

A parallel programming training mini app simulating weather-like flows

C++ 173 80 Updated Aug 11, 2025

BGHT: High-performance static GPU hash tables.

C++ 71 8 Updated Jul 2, 2025

Scalable radix top-k selection on GPUs.

Cuda 21 3 Updated Jan 27, 2025

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,235 179 Updated Jul 29, 2023

[TMLR 2024] Efficient Large Language Models: A Survey

1,253 99 Updated Jun 23, 2025

LLM inference in C/C++

C++ 94,413 14,763 Updated Feb 5, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,535 13,202 Updated Feb 5, 2026

List of papers related to neural network quantization in recent AI conferences and journals.

793 59 Updated Mar 27, 2025

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 356 39 Updated Nov 20, 2025

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 402 38 Updated Aug 13, 2024

[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Python 229 17 Updated Jan 11, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,573 944 Updated Feb 4, 2026

A post-modern modal text editor.

Rust 42,777 3,301 Updated Feb 4, 2026

CUDA on non-NVIDIA GPUs

Rust 13,898 896 Updated Jan 27, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,235 2,328 Updated Sep 3, 2025

zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation

C 28 18 Updated May 10, 2021

a Model-Free GPU Online Energy Optimization (MF-GPOEO) framework

C++ 4 2 Updated Dec 11, 2023

XiTAO is a lightweight layer built on top of modern C++ features with the goals of being low-overhead and serving as a development platform for testing scheduling and resource management algorithms.

C++ 2 1 Updated Jun 2, 2021

Material for gpu-mode lectures

Jupyter Notebook 5,683 570 Updated Feb 1, 2026

本人的科研经验

10,118 528 Updated Jan 29, 2026

A quick survival guild for i18n students who comes to chalmers.

SCSS 4 2 Updated Nov 18, 2023

Wiki fo HPC

Python 130 12 Updated Jul 23, 2025

😏国内外计算机的优秀课程,包含MIT、CMU等世界CS名校,🔥🔥其中包含计算机基础学科(操作系统、计算机网络、编译器、数据库、数据结构与算法等)以及人工智能&AI等高级科目,欢迎通过PR形式贡献!

1,716 189 Updated Apr 18, 2023

hpc-learning

778 46 Updated May 30, 2024

My curriculum vitae (CV) written using LaTeX.

TeX 886 269 Updated Sep 11, 2024

Everything you need to move your project faster

Rust 21,166 843 Updated Feb 2, 2026

程序员延寿指南 | A programmer's guide to live longer

34,781 2,376 Updated May 19, 2025

欧港新CS留学项目指北

HTML 772 62 Updated Aug 25, 2025