Skip to content
View barleyj21's full-sized avatar

Block or report barleyj21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

tiny vision language model

Python 9,765 779 Updated Apr 20, 2026

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 8,123 614 Updated Jul 17, 2024

Unofficial implementation of InstantID for ComfyUI

Python 1,445 80 Updated May 22, 2024

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,953 884 Updated Jul 18, 2024
Python 1 Updated Jan 5, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,717 4,099 Updated Apr 19, 2025

LLM inference in C/C++

C++ 116,652 19,601 Updated Jun 15, 2026

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,795 1,173 Updated Apr 8, 2026

XTTSv2 Extension for oobabooga text-generation-webui

Python 157 18 Updated Nov 21, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,563 6,115 Updated Aug 16, 2024

Industry leading face manipulation platform

Python 28,797 4,695 Updated Jun 15, 2026

A framework to enable a multimodal model to operate a computer.

Python 10,252 1,424 Updated Sep 19, 2025

ComfyUI related stuff and things

1,315 90 Updated Apr 5, 2025

Interact privately with your documents using the power of GPT, 100% privately, no data leaks

Python 4 2 Updated Jun 7, 2023

Google's SoundStorm: Efficient Parallel Audio Generation

Python 131 13 Updated Aug 8, 2023

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,545 94 Updated Apr 24, 2025

Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA

Python 124 6 Updated Jun 16, 2023

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,397 3,280 Updated Aug 17, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 5,067 543 Updated Apr 11, 2025

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 22,218 2,484 Updated Mar 10, 2026

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,927 501 Updated Jun 13, 2026

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 57,273 7,611 Updated Jun 15, 2026

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,824 3,420 Updated May 18, 2024

Code for GAN2Shape (ICLR2021 oral)

Python 574 99 Updated Jul 4, 2023

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

JavaScript 2,719 365 Updated Jun 14, 2026

BabyAGI to run with GPT4All

Python 247 33 Updated May 14, 2023

Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.

Python 47,316 5,977 Updated Jun 2, 2026

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,341 442 Updated Aug 24, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,160 4,682 Updated Aug 19, 2024
Next