Lists (27)
Sort Name ascending (A-Z)
3D objecet detection
3weijia
AIGC
AIGC_RL
CV
Deep Learning
frp
🔮 Future ideas
llm
大模型相关仓库MLLM
NR-IQA
open-mmlab
pyTorch
初学pyTorchredis
VLM
低光照综述
医学分割模型
四元数
图像融合
底层训练框架
数据集
机场物流-vue-django
模型剪枝
知识蒸馏
算法面经
细胞检测
蒸馏相关
Stars
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
Glance: Accelerating Diffusion Models with 1 Sample
Ongoing research training transformer models at scale
A general fine-tuning kit geared toward image/video/audio diffusion models.
[CVPR2023] Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling
Kandinsky 5.0: A family of diffusion models for Video & Image generation
Official PyTorch Implementation of "Flow Map Distillation Without Data"
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
https://little-misfit.github.io/GRAG-Image-Editing/
[SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
Fast and Universal 3D reconstruction model for versatile tasks
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''
Efficient vision foundation models for high-resolution generation and perception.
⚡Batch Face Processing for Fast Modern Research, including face detection, face alignment, face reconstruction, head pose estimation, face parsing
Face analysis tools for modern research, equipped with state-of-the-art Face Parsing and Face Alignment
PyTorch code and models for the DINOv2 self-supervised learning method.
[Official Code] Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction
[WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment