Lists (1)
Sort Name ascending (A-Z)
Stars
7
stars
written in C++
Clear filter
An Open Source Machine Learning Framework for Everyone
FlashMLA: Efficient Multi-head Latent Attention Kernels
High-speed Large Language Model Serving for Local Deployment
Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++