Stars
12 Lessons to Get Started Building AI Agents
Official implementation of "SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing (NeurIPS 2025)"
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
[arXiv 2025] Delta Velocity Rectified Flow for Text-to-Image Editing
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Recent weakly supervised semantic segmentation paper
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
This repository contains demos I made with the Transformers library by HuggingFace.
Official code for Phase Concentration and Shortcut Suppression for Weakly Supervised Semantic Segmentation, ECCV2024
Official respository for ECCV24 paper "Diffusion-Guided Weakly Supervised Semantic Segmentation"
Official code for Class Tokens Infusion for Weakly Supervised Semantic Segmentation, CVPR2024
Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation, ECCV2022
Official repository for CVPR 2023 paper: WSSS via Adversarial Learning of Classifier and Reconstructor
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
[CVPR 2022] CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation
some mixture of experts architecture implementations
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Official repository of "Event-guided Deblurring of Unknown Exposure Time Videos" ECCV 2022 paper(Oral).
A comprehensive list of weakly supervised semantic segmentation (WSSS) works from 2014 to 2022.