Skip to content
View fenglui's full-sized avatar

Block or report fenglui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

57 stars written in C++
Clear filter

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,969 333 Updated Jul 31, 2024

⚠️DirectML is in maintenance mode ⚠️ DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tas…

C++ 2,523 327 Updated Sep 23, 2025

huatuo是一个特性完整、零成本、高性能、低内存的近乎完美的Unity全平台原生c#热更方案。 Huatuo is a fully featured, zero-cost, high-performance, low-memory solution for Unity's all-platform native c# hotfix

C++ 2,348 382 Updated Feb 16, 2023

ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.

C++ 2,090 151 Updated Nov 7, 2025

TinyML AI inference library

C++ 1,871 239 Updated May 10, 2025

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,535 206 Updated Jul 18, 2025

Explore fractals in an audio-visual sandbox

C++ 1,226 145 Updated Dec 3, 2021

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++ 1,144 357 Updated Jan 21, 2025

Samples and Tools for Windows ML.

C++ 1,107 447 Updated Aug 7, 2025

The fastest RISC-V sandbox

C++ 951 75 Updated Nov 5, 2025

gStore - a graph based RDF triple store.

C++ 823 212 Updated Dec 19, 2024

Fast BPE

C++ 678 102 Updated Jun 18, 2024

Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.

C++ 469 49 Updated Apr 20, 2025

Fork of TensorFlow accelerated by DirectML

C++ 469 32 Updated Sep 25, 2024

The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

C++ 466 62 Updated Oct 29, 2025

Intel® Video Processing Library (Intel® VPL) API, dispatcher, and examples

C++ 325 96 Updated Aug 6, 2025

DirectML PluggableDevice plugin for TensorFlow 2

C++ 194 28 Updated Feb 27, 2025

llama.cpp for Flutter

C++ 184 27 Updated Aug 17, 2025

An implementation of HIP that works on CPUs, across OSes.

C++ 127 22 Updated Mar 19, 2024

原XAPI2,清理了部分内容,仓库更简洁

C++ 88 57 Updated Dec 9, 2021

Query-Adaptive Vector Search

C++ 60 12 Updated Nov 3, 2025

Deepspeed windows information

C++ 42 2 Updated Mar 9, 2024

A program that can render images as space filling curves.

C++ 7 1 Updated Jan 29, 2023

Development has moved to SegmentLinking/cmssw

C++ 5 15 Updated Dec 9, 2024

2D gravity simulation

C++ 5 2 Updated Sep 27, 2024

Port of Facebook's LLaMA model in C/C++

C++ 4 Updated Feb 7, 2024