Skip to content
View songcheng's full-sized avatar

Block or report songcheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FreeLighting: A Next-generation Image Relighting Model with Background Replica from Any Perspective Angle

Python 52 2 Updated Sep 16, 2025
17 Updated Jun 9, 2026

[CVPR 2026 Oral] Official implementation for ChordEdit: One-Step Low-Energy Transport for Image Editing

Python 307 13 Updated May 13, 2026

AdaRefSR is a novel reference-based one-step diffusion super-resolution framework. Paper was accepted by ICLR2026.

Python 54 Updated May 19, 2026

official github code for "SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing"

Python 133 4 Updated May 26, 2026

[CVPR26 Oral] MagicBokeh is the first unified method specifically designed for high-zoom bokeh rendering.

Python 42 1 Updated Jun 10, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 55,745 7,225 Updated Jun 15, 2026

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

Python 64,359 6,304 Updated Jun 7, 2026

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

Python 203 3 Updated May 15, 2026

GPT-Image-2 API and Prompts

Python 16,736 1,698 Updated Jun 16, 2026

Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems

Python 112 8 Updated Jan 25, 2026

Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.

Python 246 25 Updated Mar 20, 2026

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 894 38 Updated Dec 10, 2025

PengChengStarling is specifically designed for developing multilingual ASR models based on the icefall project, supporting a complete ASR pipeline that includes data processing, model training, inf…

Python 188 22 Updated Mar 6, 2025
Python 309 43 Updated Apr 15, 2026

High-Quality Voice Cloning TTS for 600+ Languages

Python 7,521 1,178 Updated Jun 11, 2026

A service to convert audio to facial blendshapes for lipsyncing and facial performances.

Python 306 49 Updated Mar 11, 2026

[CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning

Python 61 5 Updated Mar 26, 2026

sub-500ms latency phone agent orchestration

Python 661 67 Updated Mar 6, 2026

FireRed-OpenStoryline is an AI video editing agent that transforms manual editing into intention-driven directing through natural language interaction, LLM-powered planning, and precise tool orches…

Python 2,935 344 Updated May 7, 2026

Open-Source Frontier Voice AI

Python 49,402 5,506 Updated May 6, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5 1 Updated Mar 13, 2026

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 1,317 203 Updated Jun 15, 2026

FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens per step for faster, high-quality speech synthesis, featuri…

Python 49 4 Updated Feb 17, 2026

Catalan TTS fine-tune of ZipVoice. Includes model weights and data preparation scripts.

Python 7 1 Updated Apr 24, 2026

[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar

Python 76 8 Updated Mar 13, 2025

ICLR 2025 paper X-NeMo & Project X-Portrati2

Python 133 7 Updated Aug 7, 2025

KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Jupyter Notebook 389 42 Updated Jan 23, 2026

[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 3,331 466 Updated May 15, 2026

ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.

Python 134 26 Updated May 19, 2026
Next