Skip to content
View yinze00's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report yinze00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

CUDA Embedding Lookup Kernel Library

Cuda 43 5 Updated Feb 9, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,401 136 Updated Mar 11, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,307 852 Updated Mar 22, 2026

Transformer related optimization, including BERT, GPT

C++ 6,410 935 Updated Mar 27, 2024

A small utility to modify the dynamic linker and RPATH of ELF executables

C 4,174 520 Updated Dec 15, 2025

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,960 192 Updated Oct 8, 2025

List of Computer Science courses with video lectures.

79,366 10,905 Updated Mar 27, 2026

An auto serializer for C++ non-pod struct, base on C++17 feature structured-binding , constexpr if and boost preprocessor

C++ 37 11 Updated Oct 8, 2017

A JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2

C++ 2,234 305 Updated Mar 30, 2026

It's just fascinating. How is modern software designed? 🤔 Some design-level considerations for scalability, maintainability eventual consistency, availability & reliability. 👨‍💻 Interview Prep. 👨‍💻

2,162 410 Updated Feb 21, 2024

An open source, standard data file format for graph data storage and retrieval.

C++ 350 88 Updated Mar 10, 2026

A hybrid thread / fiber task scheduler written in C++ 11

C++ 1,993 199 Updated Feb 22, 2025
C++ 23 14 Updated Apr 1, 2026

A tool for control the Dell server fans speed, it sends the control instruction by ipmitool over LAN for Windows, it is a GUI application which is built by C# WinForm

C# 407 96 Updated Feb 28, 2024

LLM inference in C/C++

C++ 101,301 16,337 Updated Apr 4, 2026

Open Machine Learning Compiler Framework

Python 13,246 3,846 Updated Apr 4, 2026

Unified Executors

C++ 1,685 209 Updated Apr 3, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,796 27,397 Updated Apr 4, 2026

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,081 164 Updated Apr 4, 2026

Tensorflow examples written in C++

CMake 50 17 Updated Jun 20, 2018

cuVS - a library for vector search and clustering on the GPU

Cuda 726 178 Updated Apr 3, 2026

Pre-built libtensorflow_cc.so and Docker Images for TensorFlow C++ API

Dockerfile 67 9 Updated Jan 5, 2024

The Serenity Operating System 🐞

C++ 33,066 3,313 Updated Apr 3, 2026

SPARTA is a library of software components specially designed for building high-performance static analyzers based on the theory of Abstract Interpretation.

C++ 666 54 Updated Mar 15, 2026

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 990 228 Updated Apr 3, 2026

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

70,609 8,261 Updated Apr 3, 2026

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Alibaba Tair, Redpanda, YDB and StarRocks

C 1,799 312 Updated Apr 4, 2026

Cross-platform asynchronous I/O

C 26,739 3,861 Updated Apr 2, 2026

直播源相关资源汇总 📺 💯 IPTV、M3U —— 勤洗手、戴口罩,祝愿所有人百毒不侵

29,096 3,394 Updated Nov 14, 2025
Next