Skip to content
View ForestP's full-sized avatar

Highlights

  • Pro

Block or report ForestP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
25 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,176 11,719 Updated Dec 15, 2025

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,685 2,405 Updated Apr 29, 2025

WebUI extension for ControlNet

Python 17,877 2,031 Updated Aug 12, 2024

Stable Diffusion with Core ML on Apple Silicon

Python 17,786 1,047 Updated Jul 3, 2025

Go ahead and axolotl questions

Python 11,229 1,248 Updated Feb 4, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,293 1,003 Updated Jul 1, 2024

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,918 591 Updated Sep 7, 2024
Python 8,671 520 Updated Oct 9, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 6,504 732 Updated Mar 19, 2025

Production-ready implementation of InvisPose - a revolutionary WiFi-based dense human pose estimation system that enables real-time full-body tracking through walls using commodity mesh routers

Python 5,598 493 Updated Jan 14, 2026

A language for constraint-guided and efficient LLM programming.

Python 4,143 218 Updated May 22, 2025
Python 3,887 255 Updated Mar 15, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,275 201 Updated Oct 31, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Python 1,894 499 Updated Jun 8, 2023

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

Python 1,432 155 Updated Jan 17, 2026

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,329 86 Updated Apr 15, 2024

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Python 1,326 103 Updated Apr 25, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 822 107 Updated Feb 3, 2025

[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Python 769 43 Updated Aug 14, 2024

[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Python 280 16 Updated Apr 17, 2024

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

Python 269 25 Updated Dec 10, 2024

GitHub repository for the paper 'Personalized Restoration via Dual-Pivot Tuning'.

Python 138 2 Updated Dec 12, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 5 Updated Jan 9, 2024