Skip to content
View 1170300714's full-sized avatar

Block or report 1170300714

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501

Python 58 4 Updated Jul 26, 2024

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 608 24 Updated May 24, 2024

[NeurIPS 2025 Spotlight] Official implementation for DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing

Python 17 Updated Nov 3, 2025

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 320 7 Updated Sep 24, 2025

[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark

Python 220 2 Updated Nov 5, 2025

MIM Installs OpenMMLab Packages

Python 375 71 Updated Nov 24, 2023

基于Python的A股智能分析工具,结合大语言模型提供数据驱动的投资建议和市场洞察

Python 300 73 Updated Nov 5, 2025

Kronos: A Foundation Model for the Language of Financial Markets

Python 8,758 1,819 Updated Nov 5, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,631 2,110 Updated Jul 17, 2025

We write your reusable computer vision tools. 💜

Python 35,814 2,992 Updated Nov 5, 2025

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 2,055 261 Updated Jun 4, 2025
Python 1,371 175 Updated Nov 5, 2025

Finetuning and inference tools for the CogView4 and CogVideoX model series.

Python 101 12 Updated May 14, 2025

The official homepage of the COCO-Stuff dataset.

Shell 893 145 Updated Sep 9, 2022

Diffusers pipeline for inpainting with any available finetune

Python 34 4 Updated Jul 8, 2023

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,068 1,070 Updated Oct 29, 2025

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,991 395 Updated Jul 10, 2024

Generative Models by Stability AI

Python 26,565 2,975 Updated Nov 3, 2025

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

Python 734 71 Updated May 13, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,512 111 Updated Nov 5, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,915 2,202 Updated Oct 17, 2025

Fully Open Framework for Democratized Multimodal Training

Python 603 41 Updated Nov 2, 2025

[CVPR 2020] The first large-scale public benchmark dataset for image harmonization. The code used in our paper "DoveNet: Deep Image Harmonization via Domain Verification", CVPR2020. Useful for imag…

MATLAB 799 96 Updated May 24, 2025

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Python 8,702 1,212 Updated May 17, 2022

[SIGGRAPH Asia 2024] I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models

Python 73 3 Updated Jun 23, 2025

INFTY Engine: An Optimization Toolkit to Support Continual AI

Python 365 9 Updated Sep 13, 2025

(ACM TOMM) This is the official code repository for "VM-UNet: Vision Mamba UNet for Medical Image Segmentation".

Python 739 42 Updated Sep 3, 2025
Python 23 3 Updated Oct 4, 2024
Next