Lists (2)
Sort Name ascending (A-Z)
Stars
A deep-dive on the entire history of deep-learning
A high-throughput and memory-efficient inference and serving engine for LLMs
Official inference framework for 1-bit LLMs
Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild
Gin is a high-performance HTTP web framework written in Go. It provides a Martini-like API but with significantly better performance—up to 40 times faster—thanks to httprouter. Gin is designed for …
🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.
Official inference library for pre-processing of Mistral models
You like pytorch? You like micrograd? You love tinygrad! ❤️
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Language model alignment-focused deep learning curriculum
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
bottle.py is a fast and simple micro-framework for python web-applications.
Code for the paper "Language Models are Unsupervised Multitask Learners"
Port of OpenAI's Whisper model in C/C++
A collection of prompts, system prompts and LLM instructions
a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮
21 Lessons, Get Started Building with Generative AI
Get a ChatGPT plugin up and running in under 5 minutes!
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.