moe

Star

Here are 7 public repositories matching this topic...

microsoft / Tutel

Star

Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4

pytorch moe mixture-of-experts llm deepseek

Updated Jun 4, 2026
C

ianhom / MOE

Star

MOE is an event-driven OS for 8/16/32-bit MCUs. MOE means "Minds Of Embedded system", It’s also the name of my lovely baby daughter 😎

schedule event-driven moe easy-to-use mcu protothreads multi-task

Updated Dec 8, 2019
C

Minimal, zero-dependency LLM inference in pure C11. CPU-first with NEON/AVX2 SIMD. Flash MoE (pread + LRU expert cache). TurboQuant 3-bit KV compression (8.9x less memory per session). 20+ GGUF quant formats. Compiles to WASM.

c neon wasm inference simd moe avx2 quantization kv-cache cpu-inference llm gguf turboquant

Updated Jun 9, 2026
C

whyakari / android_kernel_xiaomi_ginkgo_old

Sponsor

Star

MoeKernel source for Xiaomi Redmi Note 8/8T source moved to https://github.com/MoeKernel/android_kernel_xiaomi_ginkgo

android kernel moe ginkgo willow moekernel

Updated Mar 22, 2024
C

AHX47 / flash-moe-universal

Star

Cross‑platform inference engine for huge AI models (1B–397B). Runs on any CPU (x86_64/ARM64) with AVX2/NEON, supports dense & MoE models (Qwen, Llama, Mistral…). GPU backends (Metal, OpenCL, CUDA) coming soon. No Python, no frameworks – pure C with optional PyQt5 GUI.

metal neon opencl x86-64 cuda moe avx2 arm64 pyqt5-desktop-application tui-app apple-silicon qwen ai-local cpu-reference ahx47

Updated Jun 2, 2026
C

AstrolexisAI / MnemoCUDA

Star

Expert streaming inference engine for MoE models larger than VRAM — run 235B+ models on consumer GPUs

gpu cuda inference moe quantization nvme vram mixture-of-experts llm expert-streaming

Updated Mar 30, 2026
C

shifulegend / project-zero

Star

CPU-optimized LLM inference engine (C)

cpu c99 inference moe avx512 ternary bitnet llm gguf deepseek

Updated Jun 7, 2026
C

Improve this page

Add a description, image, and links to the moe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the moe topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

moe

Here are 7 public repositories matching this topic...

microsoft / Tutel

ianhom / MOE

artalis-io / bitnet.c

whyakari / android_kernel_xiaomi_ginkgo_old

AHX47 / flash-moe-universal

AstrolexisAI / MnemoCUDA

shifulegend / project-zero

Improve this page

Add this topic to your repo