sxfly99

Follow

🎯

Focusing

Xiangfei Sheng sxfly99

🎯

Focusing

Follow

8 followers · 7 following

Xidian University
Xi'an, China
23:23 (UTC +08:00)

Lists (4)

Sort

AGIQA

IAA

IQA

MLLM

Stars

57 results for source starred repositories written in Python

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,193 1,664 Updated Sep 24, 2025

KwaiVGI / LivePortrait

Bring portraits to life!

Python 17,260 1,783 Updated Jun 14, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,644 2,112 Updated Jul 17, 2025

XPixelGroup / BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,865 1,359 Updated Jul 21, 2024

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,257 455 Updated Oct 27, 2025

sail-sg / EditAnything

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,416 200 Updated Feb 23, 2025

chaofengc / IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,869 227 Updated Oct 20, 2025

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,483 185 Updated Feb 16, 2025

zai-org / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,420 161 Updated Mar 3, 2025

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,169 56 Updated Nov 27, 2024

Yuliang-Liu / Monkey

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,927 138 Updated Oct 23, 2025

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,272 55 Updated Jul 23, 2025

Q-Future / Q-Align

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 517 29 Updated Mar 12, 2025

IceClear / CLIP-IQA

[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images

Python 450 20 Updated Oct 27, 2023

dingkeyan93 / DISTS

IQA: Deep Image Structure and Texture Similarity Metric

Python 449 47 Updated May 22, 2020

Linwei-Chen / FDConv

CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction

Python 285 9 Updated Oct 27, 2025

yipoh / AesBench

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Python 252 6 Updated Feb 4, 2025

Q-Future / Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Python 231 10 Updated Aug 12, 2024

zwx8981 / LIQE

[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

Python 227 13 Updated Jan 10, 2025

PKU-YuanGroup / ImgEdit

[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark

Python 223 2 Updated Nov 5, 2025

bytedance / Q-Insight

[NeurIPS 2025 Spotlight] Q-Insight: Understanding Image Quality via Visual Reinforcement Learning

Python 199 6 Updated Oct 10, 2025

h4nwei / SPAQ

[CVPR'20] Official SPAQ & Implementation

Python 189 33 Updated Jan 17, 2024

Y-Research-SBU / PosterGen

Official Repository for PosterGen

Python 175 14 Updated Oct 17, 2025

pandayuanyu / generative-photography

[CVPR 2025 Highlight] Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis

Python 156 8 Updated Oct 3, 2025

PKU-YuanGroup / Edit-R1

Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback

Python 152 3 Updated Oct 28, 2025

TianheWu / VisualQuality-R1

[NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.

Python 121 4 Updated Oct 15, 2025

DYEvaLab / EvalMuse

Python 118 12 Updated Mar 25, 2025

z-x-yang / NS-Outpainting

Very Long Natural Scenery Image Prediction by Outpainting, ICCV2019, TensorFlow

Python 91 16 Updated Feb 2, 2021

jiajunsi / RCML

Reliable Conflictive Multi-view Learning

Python 87 7 Updated Mar 24, 2024

Vchitect / ShotBench

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 84 3 Updated Sep 12, 2025