Skip to content
View bchao1's full-sized avatar
🚶‍♂️
I need to focus.
🚶‍♂️
I need to focus.

Block or report bchao1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

848 results for source starred repositories
Clear filter

Official inference repo for FLUX.2 models

Python 1,240 62 Updated Dec 1, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,528 84 Updated Nov 10, 2025

[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation

Python 657 24 Updated Nov 27, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,561 551 Updated Nov 10, 2025

[arXiv 2025] Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers

Python 50 4 Updated Aug 8, 2025

Official implementation of ICML2025 paper "ToMA: Token Merge with Attention for Diffusion Models"

Python 6 Updated Aug 6, 2025

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 33,573 8,044 Updated Aug 3, 2024

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,006 97 Updated Dec 19, 2025

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 7,190 1,438 Updated Aug 4, 2025

EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

Python 90 2 Updated Aug 20, 2025

A curated list of egocentric (first-person) vision and related area resources

303 34 Updated Oct 14, 2024

[Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Python 126 Updated Jul 31, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

603 38 Updated Nov 11, 2025

Get cookies.txt, NEVER send information outside.

JavaScript 787 83 Updated Oct 7, 2025

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Python 13,038 2,527 Updated Aug 15, 2024

Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"

Python 308 7 Updated Mar 30, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,568 120 Updated Dec 9, 2025

Export iMessage data + run iMessage Diagnostics

Rust 4,582 224 Updated Dec 16, 2025

A sparse attention kernel supporting mix sparse patterns

C++ 408 38 Updated Dec 16, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,311 606 Updated Dec 20, 2025

Tile primitives for speedy kernels

Cuda 3,008 217 Updated Dec 9, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 599 31 Updated Dec 9, 2025

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 565 31 Updated Nov 11, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 839 71 Updated Dec 17, 2025

Next-Token Prediction is All You Need

Python 2,266 91 Updated Nov 19, 2025
Python 622 30 Updated May 24, 2024

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,708 128 Updated Dec 19, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,909 1,503 Updated Dec 17, 2025

Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025

Jupyter Notebook 115 7 Updated Aug 29, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,279 1,447 Updated Nov 28, 2025
Next