wangzheallen

🎯

Focusing

Zhe Wang wangzheallen

🎯

Focusing

189 followers · 131 following

Computer Vision Researcher
SF, US
https://wangzheallen.github.io

Achievements

Highlights

Stars

FlashML-org / flashlib

Fast and memory-efficient classical machine learning operators

Python 509 37 Updated Jun 2, 2026

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,374 71 Updated Mar 5, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,700 568 Updated Nov 10, 2025

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,618 931 Updated Aug 21, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,622 792 Updated May 31, 2024

DiT-3D / DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Python 320 25 Updated May 17, 2024

siliconflow / onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,966 129 Updated Dec 4, 2025

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,959 400 Updated Feb 27, 2025

chuanyangjin / fast-DiT

Fast Diffusion Models with Transformers

Python 948 121 Updated Aug 17, 2025

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,963 513 Updated Dec 13, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,555 770 Updated Nov 24, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,953 884 Updated Jul 18, 2024

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,091 1,005 Updated Dec 2, 2025

wenhaochai / StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,439 88 Updated Sep 7, 2023

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,505 296 Updated May 31, 2024

InternRobotics / PointLLM

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 1,026 58 Updated May 15, 2026

NVlabs / FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 3,294 492 Updated Apr 29, 2026

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 4,106 321 Updated Aug 31, 2024

MarkFzp / mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 4,439 730 Updated Jun 22, 2024

csuhan / OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 667 37 Updated Oct 22, 2024

MarkFzp / act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,632 647 Updated May 15, 2024

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 2,323 535 Updated Jun 16, 2026

Tiiny-AI / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 9,565 580 Updated May 11, 2026

OpenGVLab / PonderV2

[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Python 374 8 Updated Sep 30, 2025

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,929 875 Updated Jun 10, 2024

exiawsh / StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 816 96 Updated Jun 26, 2024

Tsinghua-MARS-Lab / futr3d

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

Python 349 46 Updated Jul 6, 2023

OpenGVLab / DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, macOS, Linux）

Python 4,952 478 Updated Jul 17, 2023

DerryHub / BEVFormer_tensorrt

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Python 575 103 Updated Nov 20, 2023

jiawei-ren / diffmimic

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Python 307 21 Updated Jan 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhe Wang wangzheallen

Achievements

Achievements

Highlights

Block or report wangzheallen

Stars

FlashML-org / flashlib

tianweiy / DMD2

FoundationVision / VAR

HumanAIGC / EMO

facebookresearch / DiT

DiT-3D / DiT-3D

siliconflow / onediff

facebookresearch / jepa

chuanyangjin / fast-DiT

gaomingqi / Track-Anything

allenai / OLMo

instantX-research / InstantID

leptonai / search_with_lepton

wenhaochai / StableVideo

MooreThreads / Moore-AnimateAnyone

InternRobotics / PointLLM

NVlabs / FoundationPose

mlfoundations / open_flamingo

MarkFzp / mobile-aloha

csuhan / OneLLM

MarkFzp / act-plus-plus

AI-Hypercomputer / maxtext

Tiiny-AI / PowerInfer

OpenGVLab / PonderV2

artidoro / qlora

exiawsh / StreamPETR

Tsinghua-MARS-Lab / futr3d

OpenGVLab / DragGAN

DerryHub / BEVFormer_tensorrt

jiawei-ren / diffmimic