- China
Stars
[MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
Microsandbox — Self-Hosted Plaform for Secure Execution of Untrusted User or AI-Generated Code
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop Streaming Platform for Self-Hosting, Containers, Kubernetes, or Cloud/HPC
Official Code for ICCV 2025 paper — Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
Beautiful and accessible math in all browsers
Mobile-Agent: The Powerful GUI Agent Family
verl: Volcano Engine Reinforcement Learning for LLMs
MedResearcher-R1 is a deep research agent for medical scenarios, built on a knowledge-informed trajectory synthesis framework.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A drop-in replacement for react-markdown, designed for AI-powered streaming.
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
We write your reusable computer vision tools. 💜
Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation…
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Text-audio foundation model from Boson AI
Added vLLM support to IndexTTS for faster inference.
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.