Skip to content
View sxfly99's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Xidian University
  • Xi'an, China
  • 12:44 (UTC +08:00)

Block or report sxfly99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 155,439 13,548 Updated Nov 6, 2025

解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.

Shell 25,003 3,040 Updated Oct 18, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,187 1,665 Updated Sep 24, 2025

Bring portraits to life!

Python 17,249 1,784 Updated Jun 14, 2025

A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…

15,494 1,617 Updated Sep 24, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,629 2,110 Updated Jul 17, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 11,833 1,109 Updated Aug 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,127 548 Updated Nov 3, 2025

⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Rust 8,118 545 Updated Nov 5, 2025

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,861 1,359 Updated Jul 21, 2024

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 7,327 735 Updated Sep 8, 2025

Open-source unified multimodal model

Python 5,252 454 Updated Oct 27, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,281 364 Updated Jun 15, 2025

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

4,036 352 Updated Jan 25, 2024

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,416 200 Updated Feb 23, 2025

一本系统地教你将深度学习模型的性能最大化的战术手册。

3,076 277 Updated May 27, 2023

Collect super-resolution related papers, data, repositories

2,905 365 Updated Oct 31, 2025

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,860 226 Updated Oct 20, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,482 184 Updated Feb 16, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,420 161 Updated Mar 3, 2025

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,169 56 Updated Nov 27, 2024

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,927 138 Updated Oct 23, 2025

A comprehensive collection of IQA papers

TeX 1,367 81 Updated Oct 27, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,271 55 Updated Jul 23, 2025

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 517 29 Updated Mar 12, 2025

IQA: Deep Image Structure and Texture Similarity Metric

Python 448 47 Updated May 22, 2020

[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images

Python 447 20 Updated Oct 27, 2023

[IEEE TPAMI] A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends

373 29 Updated Nov 4, 2025

一些关于写论文的教程,防止犯一些低级错误

286 45 Updated Apr 1, 2025

CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction

Python 281 9 Updated Oct 27, 2025
Next