Skip to content
View ChenyangSi's full-sized avatar
😊
😊

Block or report ChenyangSi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 41,445 2,711 Updated Nov 4, 2025

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

1,888 78 Updated Dec 24, 2024

Open-source unified multimodal model

Python 5,250 454 Updated Oct 27, 2025

[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Python 39 4 Updated Jul 23, 2025

[ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Python 195 11 Updated Jun 8, 2025
Python 162 6 Updated Jun 27, 2025

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,470 71 Updated Oct 13, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

532 35 Updated Oct 28, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,718 78 Updated Sep 8, 2025

(CVPR 2025) DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting

Python 60 4 Updated Jul 14, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,188 65 Updated Feb 25, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,705 1,092 Updated Apr 30, 2025

The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“

Python 121 6 Updated Jan 25, 2025

Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.

137 5 Updated Aug 21, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 27,544 2,531 Updated Nov 5, 2025

[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Python 250 10 Updated Dec 27, 2024

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Python 1,554 67 Updated Jun 19, 2025

Next-Token Prediction is All You Need

Python 2,245 88 Updated Mar 17, 2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 916 23 Updated Mar 17, 2025

Official inference repo for FLUX.1 models

Python 24,599 1,808 Updated Jul 31, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,256 419 Updated Nov 3, 2025

Emu Series: Generative Multimodal Models from BAAI

Python 1,754 86 Updated Sep 27, 2024

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Python 305 12 Updated Jun 9, 2024

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 1,071 50 Updated May 24, 2025

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 532 21 Updated Jan 18, 2024

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 940 62 Updated Nov 13, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,298 85 Updated Oct 16, 2025

[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Python 506 27 Updated Mar 7, 2024
Jupyter Notebook 3,405 324 Updated May 14, 2024

[ICCV 2023] GETAvatar: Generative Textured Meshes for Animatable Human Avatars

Python 113 9 Updated Jun 18, 2025
Next