Build software better, together

airesearch-official / Z-Image-Turbo-Windows

One-click Windows installer for Z-Image Turbo AI image generation. Optimized for low-VRAM GPUs (4GB+). Features Gradio web UI, automatic setup, and GGUF model support.

image-generation windows-installer one-click-installer ai-tools stable-diffusion ai-image-generation gguf gradio-ui low-vram z-image-turbo

Updated May 8, 2026
PowerShell

pheonix-delta / WiredBrain-Hierarchical-Rag

Star

Hierarchical RAG architecture scaling to 693K chunks on consumer hardware (4GB VRAM). Features 3-address routing, hybrid vector+graph fusion, and SetFit classification.

python knowledge-graph semantic-search postgress rag vector-search edge-ai network-routing setfit pgvector advanced-rag graphrag rust-optimization agentic-rag grounded-reasoning low-vram hierarchical-rag anti-hallucination vector-databse

Updated Feb 11, 2026
Python

ThetaCursed / Anima-TrainFlow

Star

The most efficient one-page LoRA trainer for Anima 2B. Optimized for 6GB+ VRAM, featuring a smart dataset analyzer and real-time previews.

sd-scripts lora-training gradio-ui low-vram prodigy-optimizer anima-2b

Updated May 15, 2026
Python

QKV-Core / QKV-Core

Star

"Adaptive Hybrid Quantization Framework for deploying 7B+ LLMs on low-VRAM devices (e.g., GTX 1050). Features surgical block alignment and Numba-accelerated inference.

python machine-learning cuda inference transformer attention numba quantization deep-tech llm gguf low-vram qkv

Updated Jan 14, 2026
Python

Cordux / ComfyUI-Wan2.2-workflow

Star

A ComfyUI Workflow for low vram users

work text-to-image video-generation image-to-video stable-diffusion gguf lowvram comfyu low-vram wan22

Updated Jan 26, 2026

thc1006 / breeze-asr-taigi

Star

Taiwanese Hokkien (Taigi) speech-to-text transcriber - MediaTek Breeze-ASR-26 with faster-whisper, tuned for RTX 3050 4GB low-VRAM GPUs. Gradio UI, CLI, Docker, SRT/VTT/TXT/JSON.

mediatek pytorch speech-recognition automatic-speech-recognition speech-to-text gradio whisper asr hokkien taiwanese chinese-speech-recognition subtitle-generator taigi faster-whisper ctranslate2 low-vram breeze-asr rtx-3050

Updated May 15, 2026
Python

Moqabil / ComfyUI-Wan2.2-workflow

Star

🎥 Generate high-quality videos on budget hardware with the Wan 2.2 14B Low-VRAM Workflow for ComfyUI, optimized for smooth performance and quick results.

work text-to-image video-generation image-to-video stable-diffusion gguf lowvram comfyu low-vram wan22

Updated May 16, 2026

Raxephion / AuraGen-AuraFlow-WebUI

Star

Lightweight 6GB VRAM Gradio web app with auto-installer for running AuraFlow locally — no cloud, no clutter.

python open-source image-generation webui gradio text-to-image stable-diffusion diffusers local-inference generative-ai ai-image-generator auraflow low-vram

Updated Jun 7, 2025
Python

kaleic / PerkunasAITrainingPlatform

Star

Perkunas AI Training Platform is a memory-aware model training and serving system for serious language model experimentation under tight hardware limits. It combines streaming training, rich telemetry, guarded recovery, checkpoint export, and OpenAI-compatible serving.

machine-learning ai deep-learning telemetry cuda inference pytorch language-models memory-efficient model-training training-platform huggingface checkpointing llm vllm low-vram vram-optimization

Updated May 15, 2026
Python

Ken132vn / Upscale-Lite-x4-GUI

Star

Simple FP16 image upscaler for all GPUs (low-mid end users)

gui ai image-processing lite x4 tkinter-python esrgan upscaler onnxruntime portable-app directml low-vram

Updated Mar 28, 2026
Python

kelvinweijun / wan-2.2-animate-comfyui-kaggle

Star

Contains the notebooks and workflows configured to run inference from Wan 2.2 Animate with ComfyUI on Kaggle T4 GPUs smoothly

notebook ipython-notebook kaggle video-generation kaggle-notebook i2v low-vram wan22

Updated Nov 27, 2025
Jupyter Notebook

RajTewari01 / image-generation

Star

Lightweight Stable Diffusion engine with plugin-based pipelines, VRAM-safe execution, and full 4GB GPU support.

webgl nextjs pytorch gpu-optimization fastapi ai-art framer-motion stable-diffusion generative-ai low-vram

Updated Mar 31, 2026
Python

shanevcantwell / prompt-prix

Star

Audit local LLM function calling and agentic reliability. Visual tool-use benchmarking for quantized models on YOUR hardware.

open-source gradio fan-out multi-gpu ai-safety tool-use model-benchmarking llm local-inference function-calling lm-studio llm-evaluation agentic-ai open-weight low-vram constraint-compliance quantization-testing

Updated Apr 4, 2026
Python

AnshBuilt-3d / Ghost-Eye

Star

A depth-guided, category-conditioned lightweight geometry completion network for indoor furniture reconstruction on consumer low-VRAM GPUs — 35M parameters, 88 MB, 0.5 s per object

open-source mesh reconstruction 3d 3d-reconstruction 3dreconstruction low-vram

Updated May 10, 2026

RodrigoVargasMolina / liteweight-pony-trainer-8g-safetensor

Star

Lightweight SDXL LoRA trainer optimized for 8GB VRAM GPUs. GUI with training, auto-captioning (Ollama) and image search (SearXNG).

python training machine-learning ai cuda pytorch image-generation lora stable-diffusion stable-diffusion-webui sdxl low-vram low-vram-training

Updated Mar 26, 2026
Python

asmarufoglu / local-genai-forge

Star

A privacy-first Generative AI pipeline for prototyping 3D-style game assets on consumer hardware. Optimized for low-VRAM (4GB) GPUs using PyTorch, Diffusers, and Streamlit.

asset-pipeline game-assets stable-diffusion diffusers generative-ai low-vram

Updated Dec 6, 2025
Python

malrsapps / home-gpu-gpt-training-toolkit

Star

Tiny GPT-style training and local inference demo for consumer hardware.

tokenizer transformers pytorch gpt language-model gtx1050 local-ai low-vram consumer-gpu

Updated May 16, 2026

harvatechs / Ariv

Star

A production-ready, frugal, sovereign AI system that orchestrates India's open-source language models to achieve state-of-the-art reasoning on consumer hardware through Test-Time Compute (TTC) and Cognitive Serialization.

python machine-learning optimization tui artificial-intelligence indic-nlp chain-of-thought generative-ai agentic-workflow indic-ai frugal-ai low-vram model-orchestration

Updated Mar 17, 2026
Python

Trenaus / LIA-Cognitive-Engine-Showcase

Star

Technical Showcase: 22B True-MoE Engine running on 6GB VRAM (GTX 1060). Demonstrates "Surgical" NF4 quantization, dynamic expert swapping, and the custom "Grace Hopper" pipeline.

research optimization cuda inference moe custom-kernels llm systolic-array low-vram gtx1060

Updated Jan 8, 2026

mtmatheuus / QKV-Core

Star

🚀 Run modern 7B LLMs on legacy 4GB GPUs without crashes, breaking the VRAM barrier for developers facing GPU limitations.

python machine-learning cuda inference transformer attention numba quantization deep-tech llm gguf low-vram qkv

Updated May 16, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

low-vram

Here are 21 public repositories matching this topic...

airesearch-official / Z-Image-Turbo-Windows

pheonix-delta / WiredBrain-Hierarchical-Rag

ThetaCursed / Anima-TrainFlow

QKV-Core / QKV-Core

Cordux / ComfyUI-Wan2.2-workflow

thc1006 / breeze-asr-taigi

Moqabil / ComfyUI-Wan2.2-workflow

Raxephion / AuraGen-AuraFlow-WebUI

kaleic / PerkunasAITrainingPlatform

Ken132vn / Upscale-Lite-x4-GUI

kelvinweijun / wan-2.2-animate-comfyui-kaggle

RajTewari01 / image-generation

shanevcantwell / prompt-prix

AnshBuilt-3d / Ghost-Eye

RodrigoVargasMolina / liteweight-pony-trainer-8g-safetensor

asmarufoglu / local-genai-forge

malrsapps / home-gpu-gpt-training-toolkit

harvatechs / Ariv

Trenaus / LIA-Cognitive-Engine-Showcase

mtmatheuus / QKV-Core

Improve this page

Add this topic to your repo