Skip to content
View lim142857's full-sized avatar

Block or report lim142857

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Consistent Autoregressive Video Generation with Long Context

88 2 Updated Feb 6, 2026

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Python 782 78 Updated Jun 10, 2026

[CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 608 51 Updated Jun 1, 2026

[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos

Python 530 29 Updated Jun 4, 2026

[NeurIPS 2025 Spotlight] Demo implementation of MoCha Towards Movie-Grade Talking Character Synthesis

Python 15 2 Updated Dec 27, 2025

A version of verl to support diverse tool use [TMLR 2026]

Python 1,001 83 Updated Jun 8, 2026

Official Repo for MoCha Towards Movie-Grade Talking Character Synthesis

Python 62 4 Updated Dec 27, 2025

Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]

145 2 Updated Jan 27, 2025

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Python 55 Updated Jul 3, 2024

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,951 883 Updated Jul 18, 2024

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

Jupyter Notebook 654 49 Updated Oct 29, 2024

Data and Code for Program of Thoughts [TMLR 2023]

Python 316 27 Updated May 15, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,298 202 Updated Oct 31, 2024

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 952 64 Updated Nov 13, 2024

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 967 65 Updated Nov 13, 2024
Python 8,677 519 Updated Oct 9, 2024

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)

Python 183 17 Updated Oct 1, 2024

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 943 57 Updated Jul 6, 2024

Generative Models by Stability AI

Python 27,189 3,096 Updated Dec 16, 2025

[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers

Python 20 2 Updated Apr 16, 2024

[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features

Python 195 16 Updated Sep 5, 2023

An open source implementation of CLIP.

Python 13,916 1,285 Updated Jun 15, 2026

Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."

70 11 Updated Mar 10, 2022

[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion

Python 198 11 Updated Jul 31, 2025

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

Jupyter Notebook 106 9 Updated Jul 18, 2024

Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

135 5 Updated Oct 17, 2025
Python 170 41 Updated Mar 7, 2022

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

Python 411 15 Updated Feb 20, 2025

Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"

Python 15 3 Updated Jul 4, 2023
Next