Starred repositories
Qwen-Image-Lightning: Speed up Qwen-Image model with distillation
GGUF Quantization support for native ComfyUI models
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Simple, safe way to store and distribute tensors
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions
Fast and memory-efficient exact attention
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
A community-maintained Python framework for creating mathematical animations.
🦜🔗 The platform for reliable agents.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
完全免费, 自动获取新账号,一键重置新额度, 解决机器码问题, 自动满额度
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Robust Speech Recognition via Large-Scale Weak Supervision
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
GUI for a Vocal Remover that uses Deep Neural Networks.
Open-Source API Development Ecosystem • https://hoppscotch.io • Offline, On-Prem & Cloud • Web, Desktop & CLI • Open-Source Alternative to Postman, Insomnia
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Astrofox is a motion graphics program that lets you turn audio into amazing videos.