Highlights
- Pro
Stars
TensorRT and Triton Inference
NVIDIA Inference Optimizations
AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
A Datacenter Scale Distributed Inference Serving Framework
Ongoing research training transformer models at scale
A high-throughput and memory-efficient inference and serving engine for LLMs
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
scikit-learn: machine learning in Python
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
The fundamental package for scientific computing with Python.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Master programming by recreating your favorite technologies from scratch.
The official Python library for the HelpingAI API
FastAPI framework, high performance, easy to learn, fast to code, ready for production
collection for free openai keys to use in your projects
all of the workflows of n8n i could find (also from the site itself)
Jobs scraper library for LinkedIn, Indeed, Glassdoor, Google, ZipRecruiter & more
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch