Skip to content
View zibojia's full-sized avatar

Block or report zibojia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of StereoPilot

Python 104 3 Updated Dec 19, 2025
Python 17 Updated Dec 11, 2025
Python 14 1 Updated Dec 6, 2025

[ACM MM 2022] UConNet:Unsupervised Controllable Network for Image and Video Deraining

Python 12 3 Updated Apr 7, 2024

The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow:基于检索增强的图像序列上色

Python 457 40 Updated Dec 10, 2025

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 1,065 78 Updated Dec 20, 2025

[SIGGRAPH 2025] Official code of the paper "Cobra: Efficient Line Art COlorization with BRoAder References". Cobra:利用更广泛参考图实现高效线稿上色

Python 246 19 Updated Dec 10, 2025

[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny co…

Python 1,475 120 Updated Dec 23, 2025

[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models

Python 86 3 Updated Sep 8, 2025

Tiny AutoEncoder for Hunyuan Video (and other video models)

Python 334 11 Updated Mar 14, 2026

Generative Omnimatte (CVPR 2025)

Python 169 15 Updated Jun 3, 2025

This is the official implementation of our paper: "MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"

Python 549 53 Updated Jul 27, 2025

🕹️ Explore cutting-edge techniques in game generation

66 1 Updated Mar 16, 2026

This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists

Python 104 1 Updated Apr 9, 2025

2024.06.19 本项目使用Chinese-CLIP搭建文搜图/图搜图页面,旨在帮助用户快速使用跨模态检索任务。本项目代码针对MUGE数据集约19w(189585张)数据作为底库数据。本项目提供了提取特征, 检索, 以及uI代码。

Python 22 1 Updated Jun 20, 2024

媒体计算实践作业:图像——文本跨模态搜索

Python 40 10 Updated Dec 4, 2020

[ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Python 144 Updated Jan 26, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,982 150 Updated Mar 25, 2026

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Python 115 2 Updated Jun 7, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,903 89 Updated Jan 8, 2026

The official repository of "Video assistant towards large language model makes everything easy"

Python 232 15 Updated Dec 24, 2024

[NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence

Python 149 5 Updated Sep 10, 2024

[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Python 77 8 Updated Aug 13, 2024

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

Python 324 11 Updated Sep 24, 2024

LLM101n: Let's build a Storyteller

36,621 2,002 Updated Aug 1, 2024

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Python 299 17 Updated Jul 17, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,528 501 Updated Mar 22, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,861 871 Updated Jun 10, 2024

Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".

Python 86 17 Updated Aug 4, 2022

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Python 504 51 Updated Dec 2, 2022
Next