Stars
The new Windows Terminal and the original Windows console host, all in the same place!
A library for efficient similarity search and clustering of dense vectors.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
Transformer related optimization, including BERT, GPT
A flexible, high-performance serving system for machine learning models
🏋️ Python / Modern C++ Solutions of All 3735 LeetCode Problems (Weekly Update)
Header-only C++/python library for fast approximate nearest neighbors
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Visual Python and C++ nanosecond profiler, logger, tests enabler
C++11/14/17/20 Concurrency Demystified: From Core Principles to Thread-Safe Code
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
Examples for using ONNX Runtime for machine learning inferencing.
一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework
Making text a first-class citizen in TensorFlow.
OpenDHT: a C++17 Distributed Hash Table implementation
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
Fork of TensorFlow accelerated by DirectML