Skip to content
View ChenyangSi's full-sized avatar
😊
😊

Block or report ChenyangSi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
56 results for source starred repositories written in Python
Clear filter

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 93,145 10,496 Updated Nov 9, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,717 5,061 Updated Nov 6, 2025

Let us control diffusion models!

Python 33,269 2,980 Updated Feb 25, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,529 6,486 Updated Nov 9, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,570 2,534 Updated Nov 9, 2025

Generative Models by Stability AI

Python 26,581 2,978 Updated Nov 3, 2025

Official inference repo for FLUX.1 models

Python 24,612 1,808 Updated Jul 31, 2025

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 14,148 3,848 Updated Feb 18, 2025

A little word cloud generator in Python

Python 10,457 2,338 Updated Aug 31, 2025

Official repo for consistency models.

Python 6,432 434 Updated Mar 22, 2024

Graph Convolutional Networks in PyTorch

Python 5,374 1,228 Updated Sep 20, 2020

Open-source unified multimodal model

Python 5,262 456 Updated Oct 27, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,268 421 Updated Nov 7, 2025

A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.

Python 3,058 1,055 Updated Nov 25, 2022

DeepLab v3+ model in PyTorch. Support different backbones.

Python 2,995 779 Updated Aug 4, 2024

Pytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)

Python 2,597 477 Updated Apr 22, 2024

2018/2019/校招/春招/秋招/自然语言处理(NLP)/深度学习(Deep Learning)/机器学习(Machine Learning)/C/C++/Python/面试笔记,此外,还包括创建者看到的所有机器学习/深度学习面经中的问题。 除了其中 DL/ML 相关的,其他与算法岗相关的计算机知识也会记录。 但是不会包括如前端/测试/JAVA/Android等岗位中有关的问题。

Python 2,394 520 Updated Dec 4, 2018

Automated Deep Learning: Neural Architecture Search Is Not the End (a curated list of AutoDL resources and an in-depth analysis)

Python 2,321 319 Updated Sep 26, 2022

Next-Token Prediction is All You Need

Python 2,250 88 Updated Mar 17, 2025

Emu Series: Generative Multimodal Models from BAAI

Python 1,754 86 Updated Sep 27, 2024

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,724 80 Updated Sep 8, 2025

A state-of-the-art semi-supervised method for image recognition

Python 1,641 342 Updated Oct 8, 2020

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Python 1,555 68 Updated Jun 19, 2025

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,478 71 Updated Oct 13, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,301 86 Updated Oct 16, 2025

PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.

Python 1,232 256 Updated Dec 27, 2023

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,189 65 Updated Feb 25, 2025
Python 1,011 154 Updated Nov 29, 2023

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 938 62 Updated Nov 13, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 916 23 Updated Mar 17, 2025
Next