Highlights
- Pro
Stars
Stable Diffusion web UI
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Python tool for converting files and office documents to Markdown.
AI agents running research on single-GPU nanochat training automatically
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A generative world for general-purpose robotics & embodied AI learning.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Binance Exchange API python implementation for automated trading
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
😺 A tool designed to shorten steps needed to import and optimize models into VRChat. Compatible models are: MMD, XNALara, Mixamo, DAZ/Poser, Blender Rigify, Sims 2, Motion Builder, 3DS Max and pote…
This repository provides motion datasets collected by Bandai Namco Research Inc
Based on Talking-head-anime 3, works like Vtube Studio.
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.
VRM Importer, Exporter and Utilities for Blender 2.93 to 5.0
4DHumans: Reconstructing and Tracking Humans with Transformers
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
[SIGGRAPH 2025] One Model to Rig Them All: Diverse Skeleton Rigging with UniRig