Skip to content
View LinLLLL's full-sized avatar

Block or report LinLLLL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
54 stars written in Python
Clear filter

Stable Diffusion web UI

Python 162,120 30,212 Updated Mar 2, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,649 2,758 Updated Aug 12, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 20,919 1,769 Updated Mar 5, 2026

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,957 1,016 Updated Aug 12, 2024

Solve Visual Understanding with Reinforced VLMs

Python 5,926 378 Updated Mar 12, 2026

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,476 90 Updated Jun 26, 2025

Benchmarking Generalized Out-of-Distribution Detection

Python 1,040 173 Updated Dec 1, 2025

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 755 41 Updated Mar 19, 2026

[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Python 716 43 Updated Jan 10, 2025

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 574 32 Updated Feb 4, 2026

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 554 22 Updated Jan 4, 2026

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Python 470 38 Updated Dec 16, 2024

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!

Python 458 41 Updated Jun 30, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 440 21 Updated Dec 22, 2024

[ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

Python 359 23 Updated Mar 14, 2024
Python 352 27 Updated May 25, 2024

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

Python 317 13 Updated Jul 11, 2024

[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Python 309 13 Updated Jul 30, 2025

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

Python 283 13 Updated Apr 22, 2024

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

Python 246 27 Updated Jun 10, 2025

[AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Python 172 14 Updated Jul 1, 2025

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

Python 164 6 Updated Sep 9, 2024

[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Python 159 13 Updated Aug 8, 2025

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Python 156 3 Updated Dec 24, 2024

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 146 7 Updated Jun 30, 2025

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 138 11 Updated Sep 11, 2025

[ICLR2025] The official implementation of Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models

Python 105 Updated Jul 3, 2025

[NeurIPS2023] LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning

Python 102 4 Updated Jul 5, 2025

[ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.

Python 101 2 Updated Oct 20, 2025
Next