-
Michigan State University
- Seattle, USA
-
15:51
(UTC -07:00) - zeyuanyin.github.io
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Graphs that teach > graphs that impress. Turn any code, or knowledge base (Karpathy LLM wiki), into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claud…
Light Image Video Generation Inference Framework
Vero: An Open RL Recipe for General Visual Reasoning
[CVPR 2026 Highlight] MonoCoP: Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection
[CVPR 2026] MonoIA: Towards Intrinsic-Aware Monocular 3D Object Detection
Mount Hugging Face Buckets and repos as local filesystems. No download, no copy, no waiting.
💻 vibe coding 2026 | Your first modern Coding course for beginners to master step by step.
[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)
A general fine-tuning kit geared toward image/video/audio diffusion models.
A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.
This is a repository to collect training-free algorithms for visual generation and manipulation
A pipeline parallel training script for diffusion models.
Enjoy the magic of Diffusion models!
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
SGLang is a high-performance serving framework for large language models and multimodal models.
A curated list of recent efficient video generation methods.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
🔥Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR23 + IJCV24)
A collection of paper/projects that trains flow matching model/policies via RL.
A collection of papers on diffusion models for 3D generation.
[ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition
Official inference repo for FLUX.1 models
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
✅(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
This repository provides core code for managing large volumes of video footage, enabling content understanding, automatic tagging, and vector database storage. It integrates multimodal models and L…