-
-
dvattn Public
Dynamic View Attention
-
radix-turn-aware-nano-vllm Public
Radix Tree KV Cache with Turn-Aware Growth
-
llama-cpp-python Public
Forked from abetlen/llama-cpp-pythonPython bindings for llama.cpp
-
llama-cpp-python-cuBLAS-wheels Public
Forked from jllllll/llama-cpp-python-cuBLAS-wheelsWheels for llama-cpp-python compiled with cuBLAS support
-
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
-
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedOct 23, 2023