Skip to content
View caiyuanhao1998's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report caiyuanhao1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
125 stars written in Python
Clear filter

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,315 1,947 Updated Nov 1, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,159 1,388 Updated Nov 14, 2025

Enjoy the magic of Diffusion models!

Python 10,829 1,015 Updated Nov 27, 2025

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 8,460 2,026 Updated May 13, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 6,760 713 Updated Sep 24, 2025

Graph Convolutional Networks in PyTorch

Python 5,387 1,228 Updated Sep 20, 2020

Count the MACs / FLOPs of your PyTorch model.

Python 5,059 532 Updated Jul 8, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,195 371 Updated Apr 8, 2024

The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"

Python 2,999 601 Updated Nov 28, 2022

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

Python 2,588 687 Updated Jan 3, 2023

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Python 2,257 373 Updated Oct 17, 2024

Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.

Python 2,218 491 Updated Apr 30, 2024

"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Runner-Up)

Python 1,314 107 Updated Oct 10, 2025

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,301 88 Updated Jun 15, 2024

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,288 65 Updated Oct 16, 2025

A toolbox for spectral compressive imaging reconstruction including MST (CVPR 2022), CST (ECCV 2022), DAUHST (NeurIPS 2022), BiSCI (NeurIPS 2023), HDNet (CVPR 2022), MST++ (CVPRW 2022), etc.

Python 1,111 85 Updated Oct 10, 2025

ICCV 2023-2025 Papers: Discover cutting-edge research from ICCV 2023-25, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included.…

Python 963 44 Updated Nov 7, 2025

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Python 927 102 Updated Oct 10, 2025

Fast Diffusion Models with Transformers

Python 903 118 Updated Aug 17, 2025

Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time

Python 887 139 Updated Dec 16, 2023

[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo

Python 886 56 Updated Oct 4, 2024

整理 pytorch 单机多 GPU 训练方法与原理

Python 852 88 Updated Nov 23, 2021

Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"

Python 818 39 Updated May 24, 2024

[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

Python 782 23 Updated Mar 3, 2025

[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"

Python 759 62 Updated Aug 26, 2024

A Guidance on PyTorch Coding Style Based on Kaggle Dogs vs. Cats

Python 749 225 Updated Jul 13, 2020

"Structure-Aware Sparse-View X-ray 3D Reconstruction" (CVPR 2024) - A Toolbox for CT reconstruction and X-ray Novel View Synthesis

Python 749 35 Updated Oct 10, 2025

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

Python 732 140 Updated May 7, 2020

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction (ICCV 2025)

Python 645 32 Updated Nov 24, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 632 57 Updated Oct 14, 2025
Next