Stars
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Running large language models on a single GPU for throughput-oriented scenarios.
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
all kinds of text classification models and more with deep learning
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
Web interface for browsing, search and filtering recent arxiv submissions
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
A unified, comprehensive and efficient recommendation library
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
Vector (and Scalar) Quantization, in Pytorch
RepVGG: Making VGG-style ConvNets Great Again
An autoregressive character-level language model for making more things
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
Training and serving large-scale neural networks with auto parallelization.
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/