-
Hunan University
- Changsha, China
- https://caoyunkang.github.io/
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
[AAAI 2026] https://huggingface.co/csgaobb/AdaptCLIP
[DEIMv2] Real Time Object Detection Meets DINOv3
A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…
[AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
Paper list for LLM/MLLM-based image segmentation
[AAAI 2026] The Official Implementation for "Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation"
[AAAI 2026 Oral] The Official Implementation for "Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects"
(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
[Neurips 2025 Spotlight] Official repository for the paper: OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts
[AAAI 2026] Official Implementation for "AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer"
Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)
[JMS 2025] A Comprehensive Survey for Real-World Industrial Surface Defect Detection: Challenges, Approaches, and Prospects (Journal of Manufacturing Systems)
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
We have summarised all 3D anomaly detection methods and datasets (still updating).
AnomalyControl: Learning Cross-modal Semantic Features for Controllable Anomaly Synthesis
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Normal-Abnormal Guided Generalist Anomaly Detection (NeurIPS 2025)
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
Coda and Data for NeurIPS 2025 paper "MuSLR: Multimodal Symbolic Logical Reasoning"
[NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer