Stars
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
High-Resolution Image Synthesis with Latent Diffusion Models
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Stable Diffusion with Core ML on Apple Silicon
Lets make video diffusion practical!
Generate 3D objects conditioned on text or images
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Pythonic AI generation of images and videos
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
SkyReels-V2: Infinite-length Film Generative model
Stable diffusion for real-time music generation
GPT4V-level open-source multi-modal model based on Llama3-8B
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs