-
llama2.c Public
Forked from karpathy/llama2.cInference Llama 2 in one file of pure C
C MIT License UpdatedMay 11, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedApr 17, 2024 -
-
mlc-lcm Public
implement LCM(Latent Consistency Model) via tvm, then use it in Android, all of this is for work.
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 3, 2023 -
-
MNN Public
Forked from alibaba/MNNMNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
C++ UpdatedFeb 28, 2023 -
-
competition_compute Public
hom many situations TES will meet.
Python MIT License UpdatedOct 13, 2022 -
trt-samples-for-hackathon-cn Public
Forked from NVIDIA/trt-samples-for-hackathon-cnSimple samples for TensorRT programming
Jupyter Notebook Apache License 2.0 UpdatedJul 27, 2022 -
TensorRT Public
Forked from NVIDIA/TensorRTTensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
C++ Apache License 2.0 UpdatedApr 4, 2022