- Canberra, Australia
- hou-yz.github.io
- https://orcid.org/0000-0002-6916-4789
Stars
Turn any AI agent into an AI Scientist. The #1 Agent Skills library for science, used by 160,000+ scientists worldwide. 140 ready-to-use skills plus 100+ scientific databases covering biology, chem…
Automatically builds GenP executables from source
Latex template for ACM conference/Journal rebuttal.
A repository for Flclash scripts
An agentic skills framework & software development methodology that works.
Academic Research Skills for Claude Code: research → write → review → revise → finalize
[CVPR 2026] Official implementation of "ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models"
Awesome Unified Multimodal Models
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Official implementation of Continuous 3D Perception Model with Persistent State
[CVPR 2024 Highlight] Map-Relative Pose Regression for Visual Re-Localization
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
This repo will contain scripts that automatically export paper information from openreview to ACM
This checklist is designed to help you systematically prepare and polish academic papers for top conferences and journals (e.g., ICML, NeurIPS, CVPR). It incorporates widely recommended best practi…
Public facing notes page
Build your own visual reasoning model
Our library for RL environments + evals