Skip to content
View xiezhy6's full-sized avatar

Block or report xiezhy6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.

Python 1,117 107 Updated Mar 19, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 329,295 63,960 Updated Mar 22, 2026

This repository collects papers on Human-Interaction-Motion-Generation applications. We will update new papers irregularly.

259 16 Updated Oct 21, 2025

Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Python 898 50 Updated Mar 16, 2026

The official UniVerse-1 code.

Python 123 11 Updated Oct 13, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,576 205 Updated Mar 17, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 72,793 10,010 Updated Mar 19, 2026

CLIP+MLP Aesthetic Score Predictor

Python 1,268 113 Updated Jul 1, 2024

Enjoy the magic of Diffusion models!

Python 12,055 1,174 Updated Mar 20, 2026

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,411 98 Updated Sep 21, 2025

A Python wrapper for the tesseract-ocr API

Python 2,161 261 Updated Mar 16, 2026

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 495 50 Updated Aug 12, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,635 477 Updated Mar 3, 2026

The official code of Yume

Python 635 38 Updated Jan 14, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,372 42 Updated Mar 22, 2026

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,768 89 Updated Nov 28, 2025

[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"

Python 355 19 Updated Jun 13, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 7,049 490 Updated Mar 18, 2025

RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.

Python 532 79 Updated Feb 10, 2026

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,976 149 Updated Mar 20, 2026

[ECCV 2024] Official implementation of the paper "GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views"

Python 324 24 Updated Nov 23, 2025

DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

39 Updated Aug 3, 2025

The official implementation of RealisDance

Python 610 28 Updated Jun 20, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,702 250 Updated Oct 17, 2025

Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.

Python 278 34 Updated Feb 3, 2026

Official implementation for "Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches" (CVPR 2024)

Python 30 5 Updated Jul 4, 2024

Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.

Python 140 Updated May 23, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,212 108 Updated Oct 15, 2025

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 847 57 Updated Sep 8, 2025
Next