Stars
Elevate your AI research writing, no more tedious polishing ✨
A collection of awesome video generation studies.
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
Enjoy the magic of Diffusion models!
Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.
State-of-the-art 2D and 3D Face Analysis Project
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Solve Visual Understanding with Reinforced VLMs
Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection CVPR 2025
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
AugTarget data augmentation for infrared small target detection.
[AAAI2025] FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning
✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
[AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation"
Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)
[AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
Source code for AAAI'25 paper "Component-Level Segmentation for Oracle Bone Inscription Decipherment"
[AAAI'2025] The official implementation code of SIGMA
RaynorLEE / CATS
Forked from DataArcTech/CATS[AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph Reasoning"