Skip to content
View asomoza's full-sized avatar

Organizations

@huggingface

Block or report asomoza

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Perfect Green Screen Keys

Python 9,571 579 Updated Apr 8, 2026

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

TypeScript 34,035 3,835 Updated Apr 9, 2026

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 5,033 723 Updated Apr 9, 2026

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 5,380 535 Updated Sep 10, 2025

SD.Next Quantization Engine

Python 105 9 Updated Apr 10, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,443 305 Updated Jan 5, 2026

Repository with diffusers recipes by model

Python 11 3 Updated Apr 9, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 8,261 964 Updated Apr 10, 2026
Python 10,911 739 Updated Feb 9, 2026

RAM is all you need

Python 266 29 Updated Nov 28, 2025

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python 146 18 Updated May 29, 2025

Low-latency AI engine for mobile devices & wearables

C 4,607 350 Updated Apr 10, 2026

Development client for Mellon

TypeScript 18 4 Updated Feb 2, 2026

Speak Friend and Enter

Python 270 17 Updated Mar 2, 2026

Fixes AI pixel art or sprite web uploads

Python 401 30 Updated Apr 1, 2026

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,309 2,114 Updated Apr 4, 2026
Python 18 6 Updated Jul 27, 2025

DiffusersServer es un servidor de inferencia basado en FastAPI y uvicorn que permite generar imágenes a partir de texto (Text-to-Image) utilizando modelos de difusión.

Python 3 Updated Sep 15, 2025

A reimplementation of Stable Diffusion 3.5 in pure PyTorch

Python 703 33 Updated Jun 14, 2025

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR 2026) and UltraImage

Python 800 75 Updated Mar 8, 2026

Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

Python 1,156 88 Updated Apr 10, 2026

dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and related techniques, with an accompanying GUI scripting environment.

Python 43 1 Updated Oct 15, 2025

Scalable and memory-optimized training of diffusion models

Python 1,348 139 Updated Apr 8, 2026

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 279 14 Updated Jan 7, 2026
Python 1,776 257 Updated Mar 6, 2026

Towards Human-Sounding Speech

Python 6,069 517 Updated Dec 5, 2025

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,719 386 Updated Apr 9, 2026

An out-of-the-box inference acceleration engine for Diffusion and DiT models

C++ 60 1 Updated Mar 21, 2025

Agent S: an open agentic framework that uses computers like a human

Python 10,799 1,259 Updated Feb 21, 2026

[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation" . Project page: https://bizgen-msra.github.io/

Python 302 40 Updated Apr 5, 2025
Next