SDE @ Amazon Redshift | EX-SDE Intern @ Amazon Redshift & AWS Startups & Aviatrix | Computer Engineering @ UIUC
-
University Of Illinois at Urbana-Champaign
- in/yihong-jin-a11586195
Highlights
- Pro
Stars
4
stars
written in C++
Clear filter
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
High-speed Large Language Model Serving for Local Deployment
Apache Traffic Server™ is a fast, scalable and extensible HTTP/1.1 and HTTP/2 compliant caching proxy server.