bchao1

🚶‍♂️

I need to focus.

Brian Chao bchao1

🚶‍♂️

I need to focus.

Stanford Ph.D. student. Researcher at Meta Reality Labs. I work on spatial computing.

223 followers · 56 following

Stanford University
Stanford, California
https://bchao1.github.io
@BrianCChao
in/brian-chao-85425415a

Achievements

Starred repositories

848 results for source starred repositories

Clear filter

black-forest-labs / flux2

Official inference repo for FLUX.2 models

Python 1,240 62 Updated Dec 1, 2025

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,528 84 Updated Nov 10, 2025

FoundationVision / InfinityStar

[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation

Python 657 24 Updated Nov 27, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,561 551 Updated Nov 10, 2025

ignoww / RALU

[arXiv 2025] Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers

Python 50 4 Updated Aug 8, 2025

wenboluu / ToMA

Official implementation of ICML2025 paper "ToMA: Token Merge with Attention for Diffusion Models"

Python 6 Updated Aug 6, 2025

CMU-Perceptual-Computing-Lab / openpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 33,573 8,044 Updated Aug 3, 2024

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,006 97 Updated Dec 19, 2025

open-mmlab / mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 7,190 1,438 Updated Aug 4, 2025

apple / ml-egodex

EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

Python 90 2 Updated Aug 20, 2025

Sid2697 / awesome-egocentric-vision

A curated list of egocentric (first-person) vision and related area resources

303 34 Updated Oct 14, 2024

JeffWang987 / EgoVid

[Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Python 126 Updated Jul 31, 2025

mayuelala / Awesome-Controllable-Video-Generation

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

603 38 Updated Nov 11, 2025

kairi003 / Get-cookies.txt-LOCALLY

Get cookies.txt, NEVER send information outside.

JavaScript 787 83 Updated Oct 7, 2025

pytube / pytube

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Python 13,038 2,527 Updated Aug 15, 2024

ZHU-Zhiyu / NVS_Solver

Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"

Python 308 7 Updated Mar 30, 2025

nv-tlabs / vipe

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,568 120 Updated Dec 9, 2025

ReagentX / imessage-exporter

Export iMessage data + run iMessage Diagnostics

Rust 4,582 224 Updated Dec 16, 2025

mit-han-lab / Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++ 408 38 Updated Dec 16, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,311 606 Updated Dec 20, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,008 217 Updated Dec 9, 2025

svg-project / Sparse-VideoGen

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 599 31 Updated Dec 9, 2025

mit-han-lab / radial-attention

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 565 31 Updated Nov 11, 2025

thu-ml / SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 839 71 Updated Dec 17, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,266 91 Updated Nov 19, 2025

hehao13 / CameraCtrl

Python 622 30 Updated May 24, 2024

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,708 128 Updated Dec 19, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,909 1,503 Updated Dec 17, 2025

g-luo / dual_process

Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025

Jupyter Notebook 115 7 Updated Aug 29, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,279 1,447 Updated Nov 28, 2025

Brian Chao bchao1

Starred repositories

stereo-matching

light-field

information-gathering