Skip to content
View TongLi3701's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report TongLi3701

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 stars written in C++
Clear filter

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 77,376 8,335 Updated May 27, 2025

Caffe: a fast open framework for deep learning.

C++ 34,622 18,504 Updated Jul 31, 2024

Productive, portable, and performant GPU programming in Python.

C++ 28,177 2,381 Updated Apr 6, 2026

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,871 5,988 Updated Apr 30, 2026

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 23,175 4,419 Updated Apr 22, 2026

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++ 15,067 2,299 Updated Apr 28, 2026

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,949 2,352 Updated Apr 13, 2026

Achieve a tiny STL in C++11

C++ 12,433 3,312 Updated Oct 27, 2024

A distributed, fast open-source graph database featuring horizontal scalability and high availability

C++ 12,154 1,302 Updated Oct 22, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,795 1,338 Updated Apr 26, 2026

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,391 1,015 Updated Dec 4, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,240 719 Updated Apr 30, 2026

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C++ 4,690 755 Updated Jul 29, 2024

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,217 423 Updated Apr 30, 2026

高性能并行编程与优化 - 课件

C++ 4,181 560 Updated Oct 18, 2024

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,984 1,125 Updated Apr 30, 2026

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 3,415 363 Updated Jun 22, 2025

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,300 333 Updated May 16, 2023

Tutorial code on how to build your own Deep Learning System in 2k Lines

C++ 2,015 365 Updated Oct 4, 2018

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,875 250 Updated Apr 29, 2026

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,829 197 Updated Mar 17, 2026

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.

C++ 1,681 327 Updated Apr 14, 2026

a lightweight LLM model inference framework

C++ 752 95 Updated Apr 7, 2024

MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器

C++ 483 56 Updated Oct 23, 2024

Distributed LR、 FM model on Parameter Server. FTRL and SGD Optimization Algorithm.

C++ 223 82 Updated Mar 14, 2018

Caffe implementation for dynamic network surgery.

C++ 190 71 Updated Aug 15, 2017

High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph

C++ 55 7 Updated Jul 3, 2022

MIT 6.824 Lab 2012(C++)

C++ 30 6 Updated Mar 13, 2013