Stars
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Bringing Characters to Life with Computer Brains in Unity
Transformer related optimization, including BERT, GPT
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data
Implementations of CFR for solving a variety of Holdem-like poker games