Stars
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Command-line program to download videos from YouTube.com and other video sites
Tensors and Dynamic neural networks in Python with strong GPU acceleration
FastAPI framework, high performance, easy to learn, fast to code, ready for production
A high-throughput and memory-efficient inference and serving engine for LLMs
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
YOLOv5 π in PyTorch > ONNX > CoreML > TFLite
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Streamlit β A faster way to build and share data apps.
TensorFlow code and pre-trained models for BERT
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Federated query engine for AI - The only MCP Server you'll ever need
A toolkit for developing and comparing reinforcement learning algorithms.
π« Industrial-strength Natural Language Processing (NLP) in Python
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,β¦
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Deezer source separation library including pretrained models.
π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Google Chromium, sans integration with Google
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source β¦
Code for the paper "Language Models are Unsupervised Multitask Learners"
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training