-
19:51
(UTC -12:00)
Lists (1)
Sort Name ascending (A-Z)
Stars
Official Implementation of Paper Transfer between Modalities with MetaQueries
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
A Protein Large Language Model for Multi-Task Protein Language Processing
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
Official repository for the UAE paper, unified-GRPO, and unified-Bench
[ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation
Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.
DFNet: Enhance Absolute Pose Regression with Direct Feature Matching (ECCV 2022)
Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP
Odysseus: Playground of LLM Sequence Parallelism
Spiking-DDPG trains an SNN for energy-efficient mapless navigation on Intel's Loihi neuromorphic processor.
[CVPR 2024 Highlight] LMF (Latent Modulated Function for Computational Optimal Continuous Image Representation)
The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"
Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.
[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
(NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation
Synthetic data generator for image, video and 3D models
A ComfyUI node for adding nice text on image.
Minimal PyTorch implementation of TP, SP, and FSDP