Starred repositories
SwinIR: Image Restoration Using Swin Transformer (official repository)
ComfyUI extension that enables multi-GPU processing locally, remotely and in the cloud
Production-ready platform for agentic workflow development.
[CVPR 2026] LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
IronClaw is an Agent OS focused on privacy, security and extensibility
Fast, Sharp & Reliable Agentic Intelligence
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
😼 优雅地使用基于 clash/mihomo 的代理环境
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation (CVPR2026 Highlight)''
CUDA accelerated rasterization of gaussian splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
YOLOv5 ONNX Runtime C++ inference code.
Achazwl / mlc
Forked from mlc-ai/mlc-llmMiniCPM on Android platform.
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
A community-maintained Python framework for creating mathematical animations.
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
The reinforcement learning training code for AgiBot X1.
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Official inference repo for FLUX.1 models
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.