-
Westlake University
- China
- yliu-cs.github.io
-
MMaDA-VLA Public
[arXiv'26] MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
-
-
calvin Public
Forked from mees/calvinCALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Python MIT License UpdatedDec 29, 2025 -
SSR Public
[NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
-
LIBERO Public
Forked from Lifelong-Robot-Learning/LIBEROBenchmarking Knowledge Transfer in Lifelong Robot Learning
Jupyter Notebook MIT License UpdatedSep 30, 2025 -
PiTe Public
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
-
ml-depth-pro Public
Forked from apple/ml-depth-proDepth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Python Other UpdatedJan 22, 2025 -
numerize Public
Forked from davidsa03/numerizeConvert large numbers into readable numbers for humans.
Python MIT License UpdatedDec 17, 2024 -
CVLA Public
[ICMR'24] Comment-aided Video-Language Alignment via Contrastive Pre-training for Short-form Video Humor Detection
-
CMHP Public
[ECAI'23] Comment-aware Multi-modal Heterogeneous Pre-training for Humor Detection in Short-form Videos
-
CS2-Config-Presets Public
Forked from Purple-CSGO/CS2-Config-Presets๐โ CFG Presets for many scenarios in Counter-Strike 2
-
-
CSGO-Config-Presets Public
Forked from Purple-CSGO/CSGO-Config-Presets๐โ Presets of Config files for many scenarios in CS:GO
Squirrel GNU General Public License v3.0 UpdatedOct 4, 2023 -
-
ACM-Code-Library Public
Forked from under-the-keyboard/ACM-Code-LibraryACM Team Under The Keyboard's Code Library
-
-
-