WQYuan

WQYuan

2 followers · 4 following

Stars

Little-Podi / AdaWorld

[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".

Python 245 20 Updated Jun 17, 2025

Cardio-AI / endovis-ml

Visualization of dataset splits for surgical phase and instrument recognition

TypeScript 8 Updated Jun 5, 2024

wangzizhao / dyn-O

Official Implementation of Dyn-O: Building Structured World Models with Object-Centric Representations (NeurIPS 2025)

Python 8 1 Updated Feb 3, 2026

laihaoran / BrgSA

Medical 3D Vision-language alignment for abnormality zero-shot diagnosis

Python 8 Updated Oct 28, 2025

henryalps / OpenManus

OpenManus is an open-source initiative to replicate the capabilities of the Manus AI agent, a state-of-the-art general-purpose AI developed by Monica, which excels in autonomously executing complex…

Python 908 211 Updated Jun 26, 2025

ChocoWu / USG

This is the project for 'USG'.

CSS 39 Updated Apr 7, 2025

Genera1Z / VQ-VFM-OCL

Vector-Quantized Vision Foundation Models for Object-Centric Learning, ACM MM 2025.

Python 16 2 Updated May 30, 2026

showlab / SAM-I2V

[CVPR 2025] SAM-I2V

Jupyter Notebook 38 1 Updated Jan 2, 2026

MICV-yonsei / CASS

[CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation

Python 50 3 Updated Mar 27, 2025

angelvillar96 / PlaySlot

Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by Villar-Corrales & Behnke. ICML 2025

Python 22 2 Updated Apr 1, 2026

ntlm1686 / raso

A vision-language model for recognizing surgical objects in surgical images and videos.

Python 8 Updated Oct 3, 2025

PJLallen / ITG-Trip

The official code for TMI2025 work "Instrument-Tissue-Guided Surgical Action Triplet Detection via Textual-Temporal Trail Exploration".

Python 9 Updated Jan 10, 2026

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,708 877 Updated Jun 15, 2026

obiyoag / crl

[MICCAI 2025 Young Scientist Award] Official implementation of "Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification"

Python 14 1 Updated Aug 29, 2025

KanghoonYoon / torch-rasgg

This is anonymous repository for submitting our work to a conference

Jupyter Notebook 14 Updated Dec 17, 2024

ymq2017 / entitysam

[CVPR'2025] EntitySAM: Segment Everything in Video

Python 65 7 Updated Jul 13, 2025

yrcong / RelTR

RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2

Python 312 58 Updated Aug 20, 2024

whieya / Learning-to-compose

[ICLR'24] Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Python 8 1 Updated Nov 12, 2025

chen-yiliang / ProstaTD

Python 23 2 Updated May 31, 2026

dido1998 / CTRL-O

Python 21 2 Updated Jun 17, 2025

giangdip2410 / SimSMoE

Code for this paper "SimSMoE: Toward Efficient Training Mixture of Experts via Solving Representational Collapse".

Python 6 Updated May 28, 2025

eladb3 / ORViT

"Object-Region Video Transformers”, Herzig et al., CVPR 2022

Python 50 12 Updated Jul 6, 2022

mlvlab / vid-TLDR

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

Python 55 3 Updated Oct 21, 2025

PCASOlab / Xslot

Code for MICCAI2025 paper "Next slot prediction for unsupervised object discovery"

Python 6 2 Updated Mar 10, 2026

cvlab-kaist / Chrono

Official implementation of "Exploring Temporally-Aware Features for Point Tracking" (CVPR 2025)

Python 101 4 Updated Apr 5, 2025

jinlab-imvr / SurgVLM

CSS 62 3 Updated Apr 21, 2026

isyangshu / SurgVISTA

Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"

Python 50 3 Updated Jun 4, 2025

EricTan7 / RAM

[CVPR2025] Official implementation of RAM

Python 29 1 Updated Nov 4, 2025

egeozsoy / LF-SGG

Official implementation of Pix2SG, the first location-free scene graph generation method, as well as the corresponding heuristic tree search-based evaluation implemented in C++.

Python 12 Updated Sep 21, 2025

TencentARC / ST-LLM

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Python 155 7 Updated Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly