-
mobilekv Public
MobileKV: A lightweight KV cache runtime for on-device LLM inference
C++ Apache License 2.0 UpdatedMar 24, 2026 -
-
mediapipe_cmake Public
Try to reproduce mediapipe with OpenCV_lite and MNN.
-
-
-
-
SageAttention-int4 Public
Forked from thu-ml/SageAttention[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Cuda Apache License 2.0 UpdatedMar 6, 2026 -
opencv_lite Public
Lightweight OpenCV-style API with pluggable AI inference backends (TensorFlow Lite, ONNX Runtime, MNN) for edge and mobile vision.
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJan 26, 2026 -
-
MNN Public
Forked from alibaba/MNNMNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
-
ComfyUI-WanVideoWrapper Public
Forked from kijai/ComfyUI-WanVideoWrapperPython Apache License 2.0 UpdatedOct 23, 2025 -
-
-
triton-dev Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedJun 27, 2025 -
pytorch_comment Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedApr 27, 2025 -
oneflow_comment Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
C++ Apache License 2.0 UpdatedApr 10, 2025 -
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedFeb 28, 2025 -
-
resgait Public
The benchmark experiments of paper "ReSGait: The real scene gait dataset".
-
opencv Public
Forked from opencv/opencvOpen Source Computer Vision Library
-
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedJun 28, 2024 -
-
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C MIT License UpdatedApr 3, 2024 -
tflite_cmake Public
Forked from tensorflow/tensorflowcompile tflite with cmake
C++ Apache License 2.0 UpdatedMar 21, 2024 -
-
-
mediapipe_commont Public
Forked from google-ai-edge/mediapipeCross-platform, customizable ML solutions for live and streaming media.
C++ Apache License 2.0 UpdatedDec 12, 2023 -
ggml Public
Forked from ggml-org/ggmlTensor library for machine learning
C MIT License UpdatedNov 16, 2023 -
llama2.c Public
Forked from karpathy/llama2.cInference Llama 2 in one file of pure C
C MIT License UpdatedNov 14, 2023