Lists (2)
Sort Name ascending (A-Z)
Stars
📚 Freely available programming books
Stable Diffusion web UI
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
real time face swap and one-click video deepfake with only a single image
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Portable file server with accelerated resumable uploads, dedup, WebDAV, SFTP, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file
We write your reusable computer vision tools. 💜
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A generative world for general-purpose robotics & embodied AI learning.
Industry leading face manipulation platform
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Claude Code skill implementing Manus-style persistent markdown planning — the workflow pattern behind the $2B acquisition.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
A TTS model capable of generating ultra-realistic dialogue in one pass.
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
"AI-Trader: 100% Fully-Automated Agent-Native Trading"
WebUI extension for ControlNet
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Official implementation of AnimateDiff.