(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 574 32 Updated Feb 4, 2026

yongliang-wu / DFT

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 554 22 Updated Jan 4, 2026

jiwoogit / StyleID

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Python 470 38 Updated Dec 16, 2024

open-mmlab / StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!

Python 458 41 Updated Jun 30, 2025

deepcs233 / Visual-CoT

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 440 21 Updated Dec 22, 2024

Picsart-AI-Research / HD-Painter

[ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

Python 359 23 Updated Mar 14, 2024

DCDmllm / Cheetah

Python 352 27 Updated May 25, 2024

OPPO-Mente-Lab / Subject-Diffusion

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

Python 317 13 Updated Jul 11, 2024

MS-Diffusion / MS-Diffusion

[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Python 309 13 Updated Jul 30, 2025

RQ-Wu / LAMP

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

Python 283 13 Updated Apr 22, 2024

j-min / CLIP-Caption-Reward

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

Python 246 27 Updated Jun 10, 2025

hqhQAQ / MIP-Adapter

[AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Python 172 14 Updated Jul 1, 2025

FeiElysia / ViECap

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

Python 164 6 Updated Sep 9, 2024

aimagelab / LLaVA-MORE

[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Python 159 13 Updated Aug 8, 2025

ADaM-BJTU / OpenRFT

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Python 156 3 Updated Dec 24, 2024

IDEA-Research / Rex-Thinker

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 146 7 Updated Jun 30, 2025

zjunlp / Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 138 11 Updated Sep 11, 2025

LinLLLL / MaskST

[ICLR2025] The official implementation of Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models

Python 105 Updated Jul 3, 2025

AtsuMiyai / LoCoOp

[NeurIPS2023] LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning

Python 102 4 Updated Jul 5, 2025

mzhaoshuai / RLCF

[ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.

Python 101 2 Updated Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lin Zhu LinLLLL

Achievements

Achievements

Block or report LinLLLL

Stars

AUTOMATIC1111 / stable-diffusion-webui

deepseek-ai / DeepSeek-V3

haotian-liu / LLaVA

QwenLM / Qwen

IDEA-Research / GroundingDINO

om-ai-lab / VLM-R1

NVlabs / describe-anything

Jingkang50 / OpenOOD

CodeGoat24 / UnifiedReward

mit-han-lab / fastcomposer

iSEE-Laboratory / LLMDet