Skip to content
View wangzheallen's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report wangzheallen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast and memory-efficient classical machine learning operators

Python 509 37 Updated Jun 2, 2026

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,374 71 Updated Mar 5, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,700 568 Updated Nov 10, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,618 931 Updated Aug 21, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,622 792 Updated May 31, 2024

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Python 320 25 Updated May 17, 2024

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,966 129 Updated Dec 4, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,959 400 Updated Feb 27, 2025

Fast Diffusion Models with Transformers

Python 948 121 Updated Aug 17, 2025

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,963 513 Updated Dec 13, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,555 770 Updated Nov 24, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,953 884 Updated Jul 18, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,091 1,005 Updated Dec 2, 2025

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,439 88 Updated Sep 7, 2023

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,505 296 Updated May 31, 2024

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 1,026 58 Updated May 15, 2026

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 3,294 492 Updated Apr 29, 2026

An open-source framework for training large multimodal models.

Python 4,106 321 Updated Aug 31, 2024

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 4,439 730 Updated Jun 22, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 667 37 Updated Oct 22, 2024

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,632 647 Updated May 15, 2024

A simple, performant and scalable Jax LLM!

Python 2,323 535 Updated Jun 16, 2026

High-speed Large Language Model Serving for Local Deployment

C++ 9,565 580 Updated May 11, 2026

[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Python 374 8 Updated Sep 30, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,929 875 Updated Jun 10, 2024

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 816 96 Updated Jun 26, 2024

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

Python 349 46 Updated Jul 6, 2023

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

Python 4,952 478 Updated Jul 17, 2023

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Python 575 103 Updated Nov 20, 2023

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Python 307 21 Updated Jan 22, 2025
Next