- Boston, MA
- http://www.gagandeepkang.com/
Stars
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Useful zero-knowledge (ZK) problems constructed from real world scenarios.
Docker configuration for running Aquarium with a local (non-deployment) configuration
Summer 2026 software engineering, data science, AI, quant, product management, and hardware internship postings. Updated daily by Simplify and Pitt CSC.
JavaScript library for building web-based applications that employ secure multi-party computation (MPC).