-
University of Chinese Academy of Sciences
- Beijing
-
03:29
(UTC +08:00) - https://baizey.rvosuke.com
- https://orcid.org/0009-0003-8776-6980
Highlights
- Pro
Stars
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Pointcept: Perceive the world with sparse points, a codebase for point cloud perception research. Latest works: Concerto (NeurIPS'25), Sonata (CVPR'25 Highlight), PTv3 (CVPR'24 Oral)
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
【CVPR 2025 Highlight】MonSter: Marry Monodepth to Stereo Unleashes Power
DAGs with NO TEARS: Continuous Optimization for Structure Learning
[CVPR 2024] Code release for TransNeXt model
Python package for causal discovery based on LiNGAM.
Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]
Official repository of HUGS: Human Gaussian Splats (CVPR 2024)
[ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"
Official repository of CVPR 2024 paper "EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation"
[CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"
[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow
[3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting
[CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation