Skip to content
View sxfly99's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Xidian University
  • Xi'an, China
  • 21:04 (UTC +08:00)

Block or report sxfly99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
85 results for source starred repositories
Clear filter

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 155,463 13,556 Updated Nov 6, 2025

解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.

Shell 25,003 3,040 Updated Oct 18, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,189 1,665 Updated Sep 24, 2025

Bring portraits to life!

Python 17,255 1,785 Updated Jun 14, 2025

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

15,565 1,627 Updated Sep 24, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,636 2,111 Updated Jul 17, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 11,834 1,109 Updated Aug 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,139 548 Updated Nov 3, 2025

⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Rust 8,119 545 Updated Nov 5, 2025

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,863 1,359 Updated Jul 21, 2024

Open-source unified multimodal model

Python 5,253 454 Updated Oct 27, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,282 365 Updated Jun 15, 2025

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

4,037 352 Updated Jan 25, 2024

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,416 200 Updated Feb 23, 2025

一本系统地教你将深度学习模型的性能最大化的战术手册。

3,076 277 Updated May 27, 2023

Collect super-resolution related papers, data, repositories

2,905 365 Updated Oct 31, 2025

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,863 226 Updated Oct 20, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,481 184 Updated Feb 16, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,420 161 Updated Mar 3, 2025

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,169 56 Updated Nov 27, 2024

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,927 138 Updated Oct 23, 2025

A comprehensive collection of IQA papers

TeX 1,370 81 Updated Oct 27, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,272 55 Updated Jul 23, 2025

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 517 29 Updated Mar 12, 2025

[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images

Python 450 20 Updated Oct 27, 2023

IQA: Deep Image Structure and Texture Similarity Metric

Python 448 47 Updated May 22, 2020

[IEEE TPAMI] A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends

374 29 Updated Nov 4, 2025

一些关于写论文的教程,防止犯一些低级错误

286 45 Updated Apr 1, 2025

CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction

Python 284 9 Updated Oct 27, 2025

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Jupyter Notebook 278 13 Updated Aug 12, 2024
Next