Skip to content
View sxfly99's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Xidian University
  • Xi'an, China
  • 23:23 (UTC +08:00)

Block or report sxfly99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
57 results for source starred repositories written in Python
Clear filter

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,193 1,664 Updated Sep 24, 2025

Bring portraits to life!

Python 17,260 1,783 Updated Jun 14, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,644 2,112 Updated Jul 17, 2025

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,865 1,359 Updated Jul 21, 2024

Open-source unified multimodal model

Python 5,257 455 Updated Oct 27, 2025

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,416 200 Updated Feb 23, 2025

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,869 227 Updated Oct 20, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,483 185 Updated Feb 16, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,420 161 Updated Mar 3, 2025

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,169 56 Updated Nov 27, 2024

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,927 138 Updated Oct 23, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,272 55 Updated Jul 23, 2025

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 517 29 Updated Mar 12, 2025

[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images

Python 450 20 Updated Oct 27, 2023

IQA: Deep Image Structure and Texture Similarity Metric

Python 449 47 Updated May 22, 2020

CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction

Python 285 9 Updated Oct 27, 2025

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Python 252 6 Updated Feb 4, 2025

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Python 231 10 Updated Aug 12, 2024

[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

Python 227 13 Updated Jan 10, 2025

[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark

Python 223 2 Updated Nov 5, 2025

[NeurIPS 2025 Spotlight] Q-Insight: Understanding Image Quality via Visual Reinforcement Learning

Python 199 6 Updated Oct 10, 2025

[CVPR'20] Official SPAQ & Implementation

Python 189 33 Updated Jan 17, 2024

Official Repository for PosterGen

Python 175 14 Updated Oct 17, 2025

[CVPR 2025 Highlight] Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis

Python 156 8 Updated Oct 3, 2025

Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback

Python 152 3 Updated Oct 28, 2025

[NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.

Python 121 4 Updated Oct 15, 2025
Python 118 12 Updated Mar 25, 2025

Very Long Natural Scenery Image Prediction by Outpainting, ICCV2019, TensorFlow

Python 91 16 Updated Feb 2, 2021

Reliable Conflictive Multi-view Learning

Python 87 7 Updated Mar 24, 2024

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 84 3 Updated Sep 12, 2025
Next