-
University of Science and Technology of China
- Hefei, Anhui
-
17:45
(UTC +08:00) - https://github.com/guaguastandup
Starred repositories
Godot Engine – Multi-platform 2D and 3D game engine
An open-source C++ library developed and used at Facebook.
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT/STUN/TURN server and client framework based on C++11
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
FlashMLA: Efficient Multi-head Latent Attention Kernels
WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices,…
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
High-speed Large Language Model Serving for Local Deployment
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
One stop solution for all Vulkan samples
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
DSi Menu replacement for DS/DSi/3DS/2DS
从零编写游戏引擎教程 Writing a game engine tutorial from scratch
Restore a truncated mp4/mov. Improved version of ponchio/untrunc
Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Restore a damaged (truncated) mp4, m4v, mov, 3gp video. Provided you have a similar not broken video.
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
Pluggable in-process caching engine to build and scale high performance services
A fast communication-overlapping library for tensor/expert parallelism on GPUs.