Reliable and Efficient Semantic Prompt Caching with vCache
machine-learning consistency cache chatbot openai llama gpt verified memcache correctness semantic-search similarity-search rag vector-search guarantees vector-database llm vllm semantic-cache llm-memory
-
Updated
Dec 17, 2025 - Python