-
Stanford University
- Stanford, California
- https://bchao1.github.io
- @BrianCChao
- in/brian-chao-85425415a
Starred repositories
Code repository for "Spectral Progressive Diffusion for Efficient Image and Video Generation"
Code release for "Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation"
bchao1 / sglang
Forked from sgl-project/sglangSGLang is a high-performance serving framework for large language models and multimodal models.
An agentic skills framework & software development methodology that works.
Ideogram 4: Open image model at the forefront of design
Wrapper of 50+ image matching models with a unified interface
Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
SGLang is a high-performance serving framework for large language models and multimodal models.
ComfyUI Unnofficial Implementation of Spectral Progressive Diffusion for Efficient Image and Video Generation for Anima
GDM Science Skills to speed up agentic scientific workflows with better grounding and higher token efficiency. Integrate insights from AlphaGenome, AFDB, UniProt and 30+ other databases and tools.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Algorithm powering the For You feed on X
Efficient PyTorch Hessian eigendecomposition tools!
[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
Official repository for “PixelGen: Improving Pixel Diffusion with Perceptual Loss”
[CVPR 2026] Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
GenEval: An object-focused framework for evaluating text-to-image alignment
🔎 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving
The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a querya…