Skip to content
View zyds's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zyds

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

13 stars written in C++
Clear filter

An Open Source Machine Learning Framework for Everyone

C++ 192,408 74,983 Updated Nov 12, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 37,895 4,108 Updated Nov 11, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,387 5,873 Updated Nov 12, 2025

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,831 6,752 Updated Oct 25, 2023

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,356 3,539 Updated Nov 12, 2025

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 17,830 3,960 Updated Nov 11, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,352 2,272 Updated Nov 8, 2025

Transformer related optimization, including BERT, GPT

C++ 6,345 920 Updated Mar 27, 2024

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,067 413 Updated Oct 28, 2025

该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记

C++ 4,025 648 Updated Aug 18, 2023

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,299 333 Updated May 16, 2023

ONNX-TensorRT: TensorRT backend for ONNX

C++ 3,166 544 Updated Nov 6, 2025

A scalable inference server for models optimized with OpenVINO™

C++ 789 232 Updated Nov 10, 2025