- Santiago, Chile.
-
08:54
(UTC -04:00)
Lists (2)
Sort Name ascending (A-Z)
Stars
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Repository with diffusers recipes by model
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
Low-latency AI engine for mobile devices & wearables
Fixes AI pixel art or sprite web uploads
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
DiffusersServer es un servidor de inferencia basado en FastAPI y uvicorn que permite generar imágenes a partir de texto (Text-to-Image) utilizando modelos de difusión.
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR 2026) and UltraImage
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and related techniques, with an accompanying GUI scripting environment.
Scalable and memory-optimized training of diffusion models
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
An out-of-the-box inference acceleration engine for Diffusion and DiT models
Agent S: an open agentic framework that uses computers like a human
[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation" . Project page: https://bizgen-msra.github.io/