ChenyangSi

Follow

😊

ChenyangSi

😊

Follow

107 followers · 10 following

http://chenyangsi.top/

Achievements

Achievements

Stars

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 92,990 10,476 Updated Nov 7, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,515 6,481 Updated Nov 7, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,563 2,536 Updated Nov 7, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,263 420 Updated Nov 7, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 41,647 2,732 Updated Nov 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,702 5,060 Updated Nov 6, 2025

Stability-AI / generative-models

Generative Models by Stability AI

Python 26,571 2,976 Updated Nov 3, 2025

google-research / google-research

Google Research

Jupyter Notebook 36,677 8,234 Updated Oct 30, 2025

mayuelala / Awesome-Controllable-Video-Generation

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

536 35 Updated Oct 28, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,258 455 Updated Oct 27, 2025

Vchitect / VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,300 85 Updated Oct 16, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,177 319 Updated Oct 15, 2025

Gen-Verse / MMaDA

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,471 71 Updated Oct 13, 2025

naganandy / graph-based-deep-learning-literature

links to conference publications in graph-based deep learning

Jupyter Notebook 4,980 783 Updated Oct 5, 2025

Yutong-Zhou-cv / Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,401 204 Updated Sep 23, 2025

zhaoxin94 / awesome-domain-adaptation

A collection of AWESOME things about domain adaptation

5,360 884 Updated Sep 10, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,722 79 Updated Sep 8, 2025

amueller / word_cloud

A little word cloud generator in Python

Python 10,458 2,338 Updated Aug 31, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,603 1,807 Updated Jul 31, 2025

Vchitect / TACA

[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Python 39 4 Updated Jul 23, 2025

leoShen917 / DoF-Gaussian

(CVPR 2025) DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting

Python 60 4 Updated Jul 14, 2025

wusize / OpenUni

Python 162 6 Updated Jun 27, 2025

menyifang / MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Python 1,554 68 Updated Jun 19, 2025

magic-research / GETAvatar

[ICCV 2023] GETAvatar: Generative Textured Meshes for Animatable Human Avatars

Python 113 9 Updated Jun 18, 2025

Vchitect / DCM

[ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Python 195 11 Updated Jun 8, 2025

chongzhou96 / EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 1,072 50 Updated May 24, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 10,734 1,096 Updated Apr 30, 2025

Vchitect / Vchitect-2.0

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 916 23 Updated Mar 17, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,249 88 Updated Mar 17, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,189 65 Updated Feb 25, 2025