Stars
Robust Speech Recognition via Large-Scale Weak Supervision
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
WebUI extension for ControlNet
Stable Diffusion with Core ML on Apple Silicon
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Production-ready implementation of InvisPose - a revolutionary WiFi-based dense human pose estimation system that enables real-time full-body tracking through walls using commodity mesh routers
A language for constraint-guided and efficient LLM programming.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
ShivamShrirao / diffusers
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
Strong and Open Vision Language Assistant for Mobile Devices
[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
GitHub repository for the paper 'Personalized Restoration via Dual-Pivot Tuning'.
lucataco / AnyDoor
Forked from ali-vilab/AnyDoorOfficial implementations for paper: Anydoor: zero-shot object-level image customization