Starred repositories
remic-othr / OpenMIBOOD
Forked from Jingkang50/OpenOODMedical Imaging Benchmarks for Out-Of-Distribution Detection
Faster Whisper transcription with CTranslate2
Robust Speech Recognition via Large-Scale Weak Supervision
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
VisioFirm: Cross-Platform AI-assisted Annotation Tool for Computer Vision
Code of Concept Matching with Agent for Out-of-Distribution Detection [AAAI-2025]
misraya / RbA
Forked from NazirNayal8/RbAOfficial code for "RbA: Segmenting Unknown Regions Rejected by All" (ICCV 2023)
Unsupervised Semantic Segmentation by Distilling Feature Correspondences
Code for the paper "Placing Objects in Context via Inpainting for Out-of-distribution Segmentation", ECCV 2024
[CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image Segmentation
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Official code for RbA: Segmenting Unknown Regions Rejected by All (ICCV 2023)
[ICCV'23 Oral] Unmasking Anomalies in Road-Scene Segmentation
CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View
[ICCV 2021] Deep Metric Learning for Open World Semantic Segmentation
[CVPR '24] Official repository for Deformable One-shot Face Stylization via DINO Semantic Guidance
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
[CVPR 2024] Open-Set Domain Adaptation for Semantic Segmentation
[AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation"
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
✨✨Latest Advances on Multimodal Large Language Models
[ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
An update-to-date list for papers related with label-noise representation learning is here.