Lists (2)
Sort Name ascending (A-Z)
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
A high-throughput and memory-efficient inference and serving engine for LLMs
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
You like pytorch? You like micrograd? You love tinygrad! ❤️
Data validation using Python type hints
Code for the paper "Language Models are Unsupervised Multitask Learners"
Official inference framework for 1-bit LLMs
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
The official GitHub page for the survey paper "A Survey of Large Language Models".
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
bottle.py is a fast and simple micro-framework for python web-applications.
Get a ChatGPT plugin up and running in under 5 minutes!
High level asynchronous concurrency and networking framework that works on top of either Trio or asyncio
Reading list for ramping up with professional Python
Official inference library for pre-processing of Mistral models