Skip to content
View zeyuanyin's full-sized avatar
🔭
Working
🔭
Working

Highlights

  • Pro

Block or report zeyuanyin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Graphs that teach > graphs that impress. Turn any code, or knowledge base (Karpathy LLM wiki), into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claud…

TypeScript 14,947 1,392 Updated May 17, 2026

Light Image Video Generation Inference Framework

Python 2,268 199 Updated May 15, 2026

Vero: An Open RL Recipe for General Visual Reasoning

Python 121 10 Updated Apr 19, 2026

[CVPR 2026 Highlight] MonoCoP: Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection

Python 14 Updated Mar 31, 2026

[CVPR 2026] MonoIA: Towards Intrinsic-Aware Monocular 3D Object Detection

Python 17 Updated Mar 31, 2026

Mount Hugging Face Buckets and repos as local filesystems. No download, no copy, no waiting.

Rust 726 46 Updated May 11, 2026

💻 vibe coding 2026 | Your first modern Coding course for beginners to master step by step.

JavaScript 12,243 1,153 Updated May 17, 2026

[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.

Python 766 52 Updated Feb 21, 2026

Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)

Python 295 11 Updated Mar 24, 2026

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,831 280 Updated May 15, 2026

A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.

C++ 210 33 Updated May 13, 2026

This is a repository to collect training-free algorithms for visual generation and manipulation

253 10 Updated Mar 9, 2026

A pipeline parallel training script for diffusion models.

Python 1,953 272 Updated May 15, 2026

Enjoy the magic of Diffusion models!

Python 12,416 1,203 Updated May 15, 2026

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 845 44 Updated Apr 14, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 27,928 5,955 Updated May 17, 2026

A curated list of recent efficient video generation methods.

68 3 Updated Oct 7, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,099 2,078 Updated Mar 27, 2026

🔥Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR23 + IJCV24)

Python 300 27 Updated Jul 1, 2025

A collection of paper/projects that trains flow matching model/policies via RL.

394 14 Updated Dec 25, 2025

A collection of papers on diffusion models for 3D generation.

1,252 61 Updated Jan 16, 2026

[ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition

Python 18 3 Updated Sep 29, 2025

Official inference repo for FLUX.1 models

Python 25,548 1,891 Updated Jul 31, 2025

🔥Deepfake + LLM (CVPR25 Oral)

Python 112 7 Updated Jul 11, 2025

A niche toolkit for 3D computer vision tasks.

Python 322 26 Updated Mar 29, 2026

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,464 57 Updated Dec 16, 2025

✅(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 21,080 2,412 Updated Apr 27, 2026

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 811 52 Updated May 17, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 6,112 437 Updated May 16, 2026

This repository provides core code for managing large volumes of video footage, enabling content understanding, automatic tagging, and vector database storage. It integrates multimodal models and L…

Python 20 7 Updated Mar 25, 2025
Next