asomoza

Álvaro Somoza asomoza

137 followers · 6 following

@huggingface
Santiago, Chile.
04:08 (UTC -04:00)

Achievements

x4 x3 x2

Achievements

x4 x3 x2

Organizations

Lists (2)

Sort

Diffusers

2 repositories

Stable Diffusion

Stars

nv-tlabs / kimodo

Official implementation of Kimodo, a kinematic motion diffusion model for high-quality human(oid) motion generation.

Python 2,175 224 Updated Apr 25, 2026

intel / auto-round

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Python 1,070 116 Updated Apr 30, 2026

wildminder / awesome-ltx2

All available LTX-2 models, encoders, workflows, LoRAs for ComfyUI

323 26 Updated Apr 28, 2026

nikopueringer / CorridorKey

Perfect Green Screen Keys

Python 12,382 751 Updated Apr 22, 2026

badlogic / pi-mono

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

TypeScript 42,921 5,031 Updated Apr 30, 2026

deepbeepmeep / Wan2GP

Forked from Wan-Video/Wan2.1

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 5,487 800 Updated Apr 29, 2026

hzwer / ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 5,408 535 Updated Sep 10, 2025

Disty0 / sdnq

SD.Next Quantization Engine

Python 108 10 Updated Apr 16, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,484 310 Updated Jan 5, 2026

asomoza / diffusers-recipes

Repository with diffusers recipes by model

Python 13 5 Updated Apr 29, 2026

OpenBMB / VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 16,247 1,930 Updated Apr 28, 2026

Tongyi-MAI / Z-Image

Python 11,137 751 Updated Feb 9, 2026

lodestone-rock / RamTorch

RAM is all you need

Python 274 29 Updated Apr 11, 2026

sayakpaul / nanoDiT

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python 147 18 Updated May 29, 2025

cactus-compute / cactus

Low-latency AI engine for mobile devices & wearables

C 4,695 369 Updated Apr 28, 2026

cubiq / Mellon-client

Development client for Mellon

TypeScript 18 4 Updated Feb 2, 2026

cubiq / Mellon

Speak Friend and Enter

Python 279 19 Updated Mar 2, 2026

KennethJAllen / proper-pixel-art

Fixes AI pixel art or sprite web uploads

Python 417 31 Updated Apr 25, 2026

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,406 2,116 Updated Apr 20, 2026

newgenai79 / sd-diffuser-webui

Python 18 6 Updated Jul 27, 2025

FredyRivera-dev / DiffusersServer

DiffusersServer es un servidor de inferencia basado en FastAPI y uvicorn que permite generar imágenes a partir de texto (Text-to-Image) utilizando modelos de difusión.

Python 3 Updated Sep 15, 2025

yousef-rafat / miniDiffusion

A reimplementation of Stable Diffusion 3.5 in pure PyTorch

Python 702 33 Updated Jun 14, 2025

thu-ml / DiT-Extrapolation

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR 2026) and UltraImage

Python 807 75 Updated Mar 8, 2026

PrunaAI / pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

Python 1,167 88 Updated Apr 28, 2026

Teriks / dgenerate

dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and related techniques, with an accompanying GUI scripting environment.

Python 44 1 Updated Oct 15, 2025

huggingface / finetrainers

Scalable and memory-optimized training of diffusion models

Python 1,358 139 Updated Apr 8, 2026

lzyhha / VisualCloze

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 281 14 Updated Jan 7, 2026

kohya-ss / musubi-tuner

Python 1,812 263 Updated Apr 20, 2026

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 6,112 520 Updated Dec 5, 2025

facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,716 388 Updated Apr 9, 2026