Lists (2)
Sort Name ascending (A-Z)
Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
real time face swap and one-click video deepfake with only a single image
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
We write your reusable computer vision tools. 💜
Instant voice cloning by MIT and MyShell. Audio foundation model.
A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent
A TTS model capable of generating ultra-realistic dialogue in one pass.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
The NCA Toolkit API eliminates monthly subscription fees by consolidating common API functionalities into a single FREE API. Designed for businesses, creators, and developers, it streamlines advanc…
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Azur Lane bot based on azurlane-auto. Discord: https://discord.gg/vCFxDen.