Skip to content
View sunnybest1990's full-sized avatar

Block or report sunnybest1990

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Rust Code Review Guidelines , RCRG

29 Updated Oct 5, 2023

Blazingly fast LLM inference.

Rust 6,293 496 Updated Dec 19, 2025

Multilevel Optimized Matrix-matrix Multiplication Sandbox

C 8 1 Updated Jul 18, 2019

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,845 327 Updated Nov 28, 2025

A Hardware Description Language based on the Rust Programming Language

Verilog 258 22 Updated Dec 18, 2025

An attempt at achieving the theoretical best memory bandwidth of my machine.

C 53 20 Updated May 19, 2013

C++11/14/17/20 Concurrency Demystified: From Core Principles to Thread-Safe Code

C++ 2,185 347 Updated Dec 25, 2024

A handy ECS

Rust 1,188 93 Updated Dec 17, 2025

Intel AVX-512简介

C 54 6 Updated Nov 14, 2025

An APL-like programming language

BQN 1,022 66 Updated Dec 13, 2025

🚀 Beat AI 简报: 持续分享 AI 领域的关键进展,帮你征服 AI,Just beat it! 欢迎 star 订阅.

Handlebars 4,485 242 Updated Dec 4, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

11,695 1,923 Updated Aug 31, 2023

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,593 357 Updated Dec 10, 2025

A tiny library for coding with large language models.

Python 1,237 76 Updated Jul 10, 2024

A Python library transfers PyTorch tensors between CPU and NVMe

C++ 123 27 Updated Nov 27, 2024

200行写一个自动微分工具

Python 52 12 Updated Nov 7, 2019

大模型多维度中文对齐评测基准 (ACL 2024)

Python 424 31 Updated Oct 25, 2025

Scalable Hashing on Persistent Memory

C++ 193 26 Updated Apr 16, 2024

Cost/performance analysis of index structures on SSD and persistent memory (CIDR 2022)

C++ 36 1 Updated Jun 23, 2022

DINOMO: An Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory (PVLDB 2022, VLDB 2023)

Python 37 4 Updated Apr 21, 2023

RECIPE : high-performance, concurrent indexes for persistent memory (SOSP 2019)

C++ 194 46 Updated Oct 15, 2024

Persistent Memory Development Kit

C 1,391 509 Updated Nov 12, 2025

LLMPerf is a library for validating and benchmarking LLMs

Python 1,067 198 Updated Dec 9, 2024

High-performance, lock-free local and concurrent object memory pool with automated allocation, cleanup, and verification.

Rust 38 6 Updated Dec 16, 2025

Prompt 编写模式:如何将思维框架赋予机器,以设计模式的形式来思考 prompt

3,084 197 Updated Mar 22, 2023

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,874 2,274 Updated Sep 3, 2025

fastest vector database made in numpy

Python 757 39 Updated Oct 9, 2025

Ongoing research training transformer models at scale

Python 14,642 3,397 Updated Dec 19, 2025

Efficient argmin & argmax

Rust 65 9 Updated Oct 12, 2025

A fast llama2 decoder in pure Rust.

Rust 1,056 58 Updated Nov 30, 2023
Next