- China
Stars
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
本项目为 chatgpt-on-wechat下游分支, 额外对接了LLMOps平台 Dify,同时支持gewechat,相比itchat更加稳定。
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
[ICCV 2025] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
An Interactive Binary Patching Plugin for IDA Pro
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop Streaming Platform for Self-Hosting, Containers, Kubernetes, or Cloud/HPC
Added vLLM support to IndexTTS for faster inference.
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
🚘 Library to query the status of your BMW or Mini from the ConnectedDrive portal
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.
MedResearcher-R1 is a deep research agent for medical scenarios, built on a knowledge-informed trajectory synthesis framework.