W-Ted

Yuxin WANG W-Ted

Ph.D. student at HKUST

66 followers · 168 following

https://w-ted.github.io/

Achievements

Stars

hanxunyu / DepthVLM

🔥 Official code repository for "Unlocking Dense Metric Depth Estimation in VLMs"

Python 128 6 Updated May 21, 2026

tencent-ailab / Penguin-VL

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]

Jupyter Notebook 193 10 Updated Mar 30, 2026

yangcaoai / VGGT-Det-CVPR2026

Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection

Python 131 4 Updated Apr 14, 2026

MCG-NJU / VideoChat-Online

[CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online

Python 95 6 Updated Oct 7, 2025

ShaelynZ / fhavatar

Official code for "Generalizable and Animatable 3D Full-Head Gaussian Avatar from a Single Image"

16 Updated Jan 22, 2026

showlab / ShowUI-Aloha

Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.

Python 310 37 Updated Jan 20, 2026

zhangzaibin / spagent

SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.

Python 192 30 Updated May 20, 2026

bytedance / UI-TARS-desktop

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 36,413 3,671 Updated May 18, 2026

TencentCloudADP / youtu-tip

Youtu-Tip: Tap for Intelligence, Keep on Device.

Python 591 66 Updated Feb 27, 2026

JIA-Lab-research / RePlan

RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing

Python 65 3 Updated Mar 19, 2026

Computer-use-agents / dart-gui

DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

Python 93 6 Updated Feb 26, 2026

lifuguan / GGRt_official

[ECCV 2024] GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time

Python 105 4 Updated Apr 3, 2025

gyy456 / CityGS-X

[ICCV 2025] CityGS-X : A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction

Python 167 15 Updated May 15, 2026

meituan / EvoCUA

EvoCUA: Evolving Computer Use Agent

Python 325 24 Updated Mar 31, 2026

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 472 32 Updated May 20, 2026

UVA-Computer-Vision-Lab / LabelAny3D

[NeurIPS 2025] LabelAny3D: Label Any Object 3D in the Wild

Python 130 9 Updated Jan 6, 2026

vivo / DiMo-GUI

[EMNLP 2025]Repository for paper "DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning"

Python 30 3 Updated Jul 2, 2025

Zhoues / RoboTracer

Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"

Python 74 2 Updated Jan 19, 2026

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 17,192 1,951 Updated Jun 14, 2026

Zhoues / RoboRefer

[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"

Python 263 11 Updated Dec 16, 2025

wzpscott / FlashVGGT

Accelerate VGGT with efficient desciptor-based global attention

Python 88 2 Updated Jun 3, 2026

LiuJF1226 / Mono4DGS-HDR

[ICLR 2026] Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos

28 1 Updated May 29, 2026

hkdsc / fullpart

A part-based 3D generation framework & the largest and most comprehensively annotated 3D part dataset.

Jupyter Notebook 141 9 Updated Dec 15, 2025

zai-org / CogCoM

Jupyter Notebook 223 14 Updated Jul 5, 2024

chenhaomingbob / CSC

[CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception" https://arxiv.org/abs/2405.07201

Python 17 Updated Jun 11, 2024

longvideoagent / LongVideoAgent

Python 116 5 Updated Apr 8, 2026

chengzhag / PanSplat

🍳 [CVPR'25] PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

Python 227 15 Updated Apr 19, 2026

weikaih04 / Synthetic-Detection-Segmentation-Grounding-Data

[CVPR 2026] An accurate and dense-annotated synthetic dataset for training SOTA detectors / segmentors / Grounding-VLMs.

Python 47 Updated Feb 23, 2026

lpiccinelli-eth / UniDepth

Universal Monocular Metric Depth Estimation

Python 1,212 113 Updated May 18, 2025

cvg / 3D-MOOD

[ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

Python 122 8 Updated Oct 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuxin WANG W-Ted

Achievements

Achievements

Block or report W-Ted

Stars

hanxunyu / DepthVLM

tencent-ailab / Penguin-VL

yangcaoai / VGGT-Det-CVPR2026

MCG-NJU / VideoChat-Online

ShaelynZ / fhavatar

showlab / ShowUI-Aloha

zhangzaibin / spagent

bytedance / UI-TARS-desktop

TencentCloudADP / youtu-tip

JIA-Lab-research / RePlan

Computer-use-agents / dart-gui

lifuguan / GGRt_official

gyy456 / CityGS-X

meituan / EvoCUA

NVlabs / GDPO

UVA-Computer-Vision-Lab / LabelAny3D

vivo / DiMo-GUI

Zhoues / RoboTracer

camel-ai / camel

Zhoues / RoboRefer

wzpscott / FlashVGGT

LiuJF1226 / Mono4DGS-HDR

hkdsc / fullpart

zai-org / CogCoM

chenhaomingbob / CSC

longvideoagent / LongVideoAgent

chengzhag / PanSplat

weikaih04 / Synthetic-Detection-Segmentation-Grounding-Data

lpiccinelli-eth / UniDepth

cvg / 3D-MOOD