Skip to content
View wuyuqiang's full-sized avatar

Block or report wuyuqiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Parent of *-lite repositories, a migration path to post-C++98 library features via polyfills.

90 5 Updated Oct 7, 2025

Fast Static Symbol Table (FSST): efficient random-access string compression

C++ 499 50 Updated Nov 26, 2025

VictoriaMetrics: fast, cost-effective monitoring solution and time series database

Go 16,333 1,571 Updated Feb 17, 2026

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 5,333 403 Updated Feb 9, 2026

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Rust 1,701 381 Updated Feb 18, 2026

Next-Gen Big Data File Format

C++ 658 35 Updated Oct 11, 2025

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 22,019 2,177 Updated Feb 18, 2026

对各类图书资源的收集。大量计算机、AI方面书籍。

1,901 387 Updated May 29, 2023

An extremely fast Python package and project manager, written in Rust.

Rust 79,383 2,574 Updated Feb 18, 2026

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

Jupyter Notebook 19,313 5,425 Updated Oct 14, 2021

An easy to use PyTorch to TensorRT converter

Python 4,855 697 Updated Aug 17, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,699 2,315 Updated Feb 13, 2026

mimalloc is a compact general purpose allocator with excellent performance.

C 12,491 1,054 Updated Feb 6, 2026

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,569 210 Updated Feb 13, 2026

CUDA Core Compute Libraries

C++ 2,173 343 Updated Feb 18, 2026

A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)

C++ 20,196 3,198 Updated Feb 17, 2026

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

C++ 4,998 759 Updated Feb 8, 2024

A family of header-only, very fast and memory-friendly hashmap and btree containers.

C++ 3,152 304 Updated Dec 6, 2025

The central registry of Bazel modules for the Bzlmod external dependency system.

Starlark 355 670 Updated Feb 18, 2026

A type safe SQL template library for C++

C++ 2,609 355 Updated Dec 13, 2025

A Template Engine for Modern C++

C++ 1,909 241 Updated Jan 9, 2026

[SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search

C++ 177 29 Updated Jun 5, 2025

[SIGMOD 2025] Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search

C++ 61 14 Updated Jun 4, 2025

🦆 A curated list of awesome DuckDB resources

2,287 169 Updated Feb 4, 2026

Examples from Programming in Parallel with CUDA

Cuda 170 62 Updated Feb 5, 2026

A fast multi-producer, multi-consumer lock-free concurrent queue for C++11

C++ 12,059 1,896 Updated Feb 14, 2026

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,977 608 Updated Feb 11, 2026

Build rules for interfacing with "foreign" (non-Bazel) build systems (CMake, configure-make, GNU Make, boost, ninja, Meson)

Starlark 727 262 Updated Feb 6, 2026

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 982 225 Updated Feb 18, 2026

Efficient binary-decimal and decimal-binary conversion routines for IEEE doubles.

C++ 1,183 302 Updated Feb 2, 2026
Next