Stars
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
A high-throughput and memory-efficient inference and serving engine for LLMs
The definitive Web UI for local AI, with powerful features and easy setup.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
WebUI extension for ControlNet
Lets make video diffusion practical!
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
An open source implementation of CLIP.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Hackable and optimized Transformers building blocks, supporting a composable construction.
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
a state-of-the-art-level open visual language model | 多模态预训练模型
Model parallel transformers in JAX and Haiku