- Xi'an
Lists (3)
Sort Name ascending (A-Z)
Stars
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Graph Neural Network Library for PyTorch
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
PyTorch implementations of Generative Adversarial Networks.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
SwinIR: Image Restoration Using Swin Transformer (official repository)
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
Flops counter for neural networks in pytorch framework
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
Pytorch implementation of various Knowledge Distillation (KD) methods.
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
[CVPR 2021] Multi-Stage Progressive Image Restoration. SOTA results for Image deblurring, deraining, and denoising.
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Runner-Up)
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image …
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
Real-World Super-Resolution via Kernel Estimation and Noise Injection
[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution