Skip to content
View WQYuan's full-sized avatar

Block or report WQYuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".

Python 245 20 Updated Jun 17, 2025

Visualization of dataset splits for surgical phase and instrument recognition

TypeScript 8 Updated Jun 5, 2024

Official Implementation of Dyn-O: Building Structured World Models with Object-Centric Representations (NeurIPS 2025)

Python 8 1 Updated Feb 3, 2026

Medical 3D Vision-language alignment for abnormality zero-shot diagnosis

Python 8 Updated Oct 28, 2025

OpenManus is an open-source initiative to replicate the capabilities of the Manus AI agent, a state-of-the-art general-purpose AI developed by Monica, which excels in autonomously executing complex…

Python 908 211 Updated Jun 26, 2025

This is the project for 'USG'.

CSS 39 Updated Apr 7, 2025

Vector-Quantized Vision Foundation Models for Object-Centric Learning, ACM MM 2025.

Python 16 2 Updated May 30, 2026

[CVPR 2025] SAM-I2V

Jupyter Notebook 38 1 Updated Jan 2, 2026

[CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation

Python 50 3 Updated Mar 27, 2025

Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by Villar-Corrales & Behnke. ICML 2025

Python 22 2 Updated Apr 1, 2026

A vision-language model for recognizing surgical objects in surgical images and videos.

Python 8 Updated Oct 3, 2025

The official code for TMI2025 work "Instrument-Tissue-Guided Surgical Action Triplet Detection via Textual-Temporal Trail Exploration".

Python 9 Updated Jan 10, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,708 877 Updated Jun 15, 2026

[MICCAI 2025 Young Scientist Award] Official implementation of "Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification"

Python 14 1 Updated Aug 29, 2025

This is anonymous repository for submitting our work to a conference

Jupyter Notebook 14 Updated Dec 17, 2024

[CVPR'2025] EntitySAM: Segment Everything in Video

Python 65 7 Updated Jul 13, 2025

RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2

Python 312 58 Updated Aug 20, 2024

[ICLR'24] Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Python 8 1 Updated Nov 12, 2025
Python 23 2 Updated May 31, 2026
Python 21 2 Updated Jun 17, 2025

Code for this paper "SimSMoE: Toward Efficient Training Mixture of Experts via Solving Representational Collapse".

Python 6 Updated May 28, 2025

"Object-Region Video Transformers”, Herzig et al., CVPR 2022

Python 50 12 Updated Jul 6, 2022

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

Python 55 3 Updated Oct 21, 2025

Code for MICCAI2025 paper "Next slot prediction for unsupervised object discovery"

Python 6 2 Updated Mar 10, 2026

Official implementation of "Exploring Temporally-Aware Features for Point Tracking" (CVPR 2025)

Python 101 4 Updated Apr 5, 2025
CSS 62 3 Updated Apr 21, 2026

Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"

Python 50 3 Updated Jun 4, 2025

[CVPR2025] Official implementation of RAM

Python 29 1 Updated Nov 4, 2025

Official implementation of Pix2SG, the first location-free scene graph generation method, as well as the corresponding heuristic tree search-based evaluation implemented in C++.

Python 12 Updated Sep 21, 2025

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Python 155 7 Updated Sep 10, 2024
Next