-
Johns Hopkins University <- Tsinghua
- Baltimore, United States
-
00:24
(UTC -05:00) - https://caiyuanhao1998.github.io/
Stars
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Wan: Open and Advanced Large-Scale Video Generative Models
Enjoy the magic of Diffusion models!
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Count the MACs / FLOPs of your PyTorch model.
Official implementations for paper: Anydoor: zero-shot object-level image customization
The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"
PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Runner-Up)
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
A toolbox for spectral compressive imaging reconstruction including MST (CVPR 2022), CST (ECCV 2022), DAUHST (NeurIPS 2022), BiSCI (NeurIPS 2023), HDNet (CVPR 2022), MST++ (CVPRW 2022), etc.
ICCV 2023-2025 Papers: Discover cutting-edge research from ICCV 2023-25, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included.…
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
整理 pytorch 单机多 GPU 训练方法与原理
Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"
A Guidance on PyTorch Coding Style Based on Kaggle Dogs vs. Cats
"Structure-Aware Sparse-View X-ray 3D Reconstruction" (CVPR 2024) - A Toolbox for CT reconstruction and X-ray Novel View Synthesis
LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction (ICCV 2025)
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.