1170300714

Follow

1170300714

Follow

15 followers · 24 following

Achievements

Achievements

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

Kevinz-code / SeVa

[MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501

Python 58 4 Updated Jul 26, 2024

tgxs002 / HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 608 24 Updated May 24, 2024

xiechenxi99 / DNAEdit_code

[NeurIPS 2025 Spotlight] Official implementation for DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing

Python 17 Updated Nov 3, 2025

KwaiVGI / VideoAlign

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 320 7 Updated Sep 24, 2025

PKU-YuanGroup / ImgEdit

[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark

Python 220 2 Updated Nov 5, 2025

open-mmlab / mim

MIM Installs OpenMMLab Packages

Python 375 71 Updated Nov 24, 2023

Ogannesson / ashare-llm-analyst

基于Python的A股智能分析工具，结合大语言模型提供数据驱动的投资建议和市场洞察

Python 300 73 Updated Nov 5, 2025

shiyu-coder / Kronos

Kronos: A Foundation Model for the Language of Financial Markets

Python 8,755 1,818 Updated Nov 5, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,630 2,111 Updated Jul 17, 2025

roboflow / supervision

We write your reusable computer vision tools. 💜

Python 35,809 2,991 Updated Nov 5, 2025

microsoft / LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 2,055 261 Updated Jun 4, 2025

kohya-ss / musubi-tuner

Python 1,369 175 Updated Nov 5, 2025

THUDM / CogKit

Finetuning and inference tools for the CogView4 and CogVideoX model series.

Python 101 12 Updated May 14, 2025

nightrome / cocostuff

The official homepage of the COCO-Stuff dataset.

Shell 893 145 Updated Sep 9, 2022

Vadbeg / diffusers-inpainting

Diffusers pipeline for inpainting with any available finetune

Python 34 4 Updated Jul 8, 2023

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,068 1,070 Updated Oct 29, 2025

AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,991 395 Updated Jul 10, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 26,564 2,975 Updated Nov 3, 2025

thu-ml / RIFLEx

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

Python 734 71 Updated May 13, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,512 111 Updated Nov 5, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,915 2,202 Updated Oct 17, 2025

LAION-AI / LAION-5B-WatermarkDetection

Python 126 16 Updated Jan 10, 2023

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Fully Open Framework for Democratized Multimodal Training

Python 603 41 Updated Nov 2, 2025

bcmi / Image-Harmonization-Dataset-iHarmony4

[CVPR 2020] The first large-scale public benchmark dataset for image harmonization. The code used in our paper "DoveNet: Deep Image Harmonization via Domain Verification", CVPR2020. Useful for imag…

MATLAB 799 96 Updated May 24, 2025

NVIDIA / vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Python 8,702 1,212 Updated May 17, 2022

Vicky0522 / I2VEdit

[SIGGRAPH Asia 2024] I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models

Python 73 3 Updated Jun 23, 2025

THUDM / INFTY

INFTY Engine: An Optimization Toolkit to Support Continual AI

Python 365 9 Updated Sep 13, 2025

JCruan519 / VM-UNet

(ACM TOMM) This is the official code repository for "VM-UNet: Vision Mamba UNet for Medical Image Segmentation".

Python 739 42 Updated Sep 3, 2025

AlfredQin / DB-SAM

Python 23 3 Updated Oct 4, 2024

zhaoziheng / SAT

[npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"

Python 241 17 Updated Nov 5, 2025