Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Robust Speech Recognition via Large-Scale Weak Supervision
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
TensorFlow code and pre-trained models for BERT
A toolkit for developing and comparing reinforcement learning algorithms.
The official Python library for the OpenAI API
SGLang is a high-performance serving framework for large language models and multimodal models.
Code for the paper "Language Models are Unsupervised Multitask Learners"
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
An educational resource to help anyone learn deep reinforcement learning.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
AlphaFold 3 inference pipeline.
Code for the paper "Evaluating Large Language Models Trained on Code"
Post-training with Tinker
Code for the paper Fine-Tuning Language Models from Human Preferences
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Code for "Learning to summarize from human feedback"
Code for the paper "Exploration by Random Network Distillation"
A suite of test scenarios for multi-agent reinforcement learning.
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation