Highlights
- Pro
Stars
my own studied materials and scripts
PyTorch implementation for paper "CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection"
The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
[ICCV2025] 3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation
Official implementation of "Chemical knowledge-informed framework for privacy-aware retrosynthesis learning".
(ICLR25 Oral) Do as We Do, Not as You Think: the Conformity of Large Language Models
[NeurIPS'24] Scene Graph Generation with Role-Playing Large Language Models
The official implementation of "Human101: Training 100+FPS Human Gaussians in 100s from 1 View".
[NeurIPS2023] Neural-Logic Human-Object Interaction Detection
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
👀 | MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime Inference
PyTorch implementation for Contrastive Representation Learning for Gaze Estimation
3QFP: Efficient neural implicit surface reconstruction using Tri-Quadtrees and Fourier feature Positional encoding
This is the official implementation of "Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds" (Accepted at AAAI 2024).
This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at CVPR 2024).
This is the official implementation of "Clustering based Point Cloud Representation Learning for 3D Analysis" (Accepted at ICCV 2023).
This is the official implementation of "Clustering Propagation for Universal Medical Image Segmentation" (Accepted at CVPR 2024).
[CVPR24] Volumetric Environment Representation for Vision-Language Navigation
[CVPR'24] Neural Clustering based Visual Representation Learning
An extension of the CVPR paper (The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation)
[CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
(ICCV23 Oral) LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models