Skip to content
View xiezhy6's full-sized avatar

Block or report xiezhy6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.

Python 1,139 108 Updated Mar 19, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 339,261 66,783 Updated Mar 28, 2026

This repository collects papers on Human-Interaction-Motion-Generation applications. We will update new papers irregularly.

262 16 Updated Oct 21, 2025

Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Python 905 51 Updated Mar 16, 2026

The official UniVerse-1 code.

Python 123 11 Updated Oct 13, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,578 205 Updated Mar 17, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,238 10,044 Updated Mar 26, 2026

CLIP+MLP Aesthetic Score Predictor

Python 1,268 113 Updated Jul 1, 2024

Enjoy the magic of Diffusion models!

Python 12,114 1,179 Updated Mar 24, 2026

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,413 98 Updated Sep 21, 2025

A Python wrapper for the tesseract-ocr API

Python 2,161 261 Updated Mar 16, 2026

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 495 50 Updated Aug 12, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,648 477 Updated Mar 3, 2026

The official code of Yume

Python 637 38 Updated Jan 14, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,403 44 Updated Mar 27, 2026

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,769 89 Updated Nov 28, 2025

[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"

Python 361 19 Updated Jun 13, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 7,049 493 Updated Mar 18, 2025

RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.

Python 535 81 Updated Feb 10, 2026

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,981 150 Updated Mar 25, 2026

[ECCV 2024] Official implementation of the paper "GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views"

Python 325 24 Updated Nov 23, 2025

DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

40 Updated Aug 3, 2025

The official implementation of RealisDance

Python 611 28 Updated Jun 20, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,706 251 Updated Oct 17, 2025

Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.

Python 277 34 Updated Feb 3, 2026

Official implementation for "Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches" (CVPR 2024)

Python 30 5 Updated Jul 4, 2024

Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.

Python 140 Updated May 23, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,211 108 Updated Oct 15, 2025

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 848 57 Updated Sep 8, 2025
Next