Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Command-line program to download videos from YouTube.com and other video sites
Tensors and Dynamic neural networks in Python with strong GPU acceleration
FastAPI framework, high performance, easy to learn, fast to code, ready for production
A high-throughput and memory-efficient inference and serving engine for LLMs
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Streamlit — A faster way to build and share data apps.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Federated query engine for AI - The only MCP Server you'll ever need
A toolkit for developing and comparing reinforcement learning algorithms.
💫 Industrial-strength Natural Language Processing (NLP) in Python
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Deezer source separation library including pretrained models.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Google Chromium, sans integration with Google
Code for the paper "Language Models are Unsupervised Multitask Learners"
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.