-
HKUST
- Hong Kong , China
-
03:18
(UTC +08:00) - https://github.com/Atrewin
- https://jhuiye.com
Highlights
- Pro
Stars
[ICML 2026] This repo is the official implementation of "LangForce : Bayesian Decomposition of Vision Language Action Models via Latent Action Queries"
Team Comet's 2025 BEHAVIOR Challenge Codebase
1st place solution of 2025 BEHAVIOR Challenge
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation (CVPR2026 Highlight)''
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
SePer is an accurate / fast / free-of-API metric to measure document quality via information gain
[NeurIPS 2025] Efficient Reasoning Vision Language Models
TStar is a unified temporal search framework for long-form video question answering
Code for the EMNLP 2024 paper "Improve Dense Passage Retrieval with Entailment Tuning"
This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.
API to run VirtualHome, a Multi-Agent Household Simulator
An Examination of the Compositionality of Large Generative Vision-Language Models