Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Google Research
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A generative world for general-purpose robotics & embodied AI learning.
Generative Models by Stability AI
Official inference repo for FLUX.1 models
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
A little word cloud generator in Python
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Official repo for consistency models.
Taming Transformers for High-Resolution Image Synthesis
A curated list of awesome self-supervised methods
A collection of AWESOME things about domain adaptation
A curated list of recent diffusion models for video generation, editing, and various other applications.
links to conference publications in graph-based deep learning
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
DeepLab v3+ model in PyTorch. Support different backbones.
Pytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)