- Vancouver, Canada
- gguz(at)cs.ubc.ca
Stars
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.
A convex-set-based approach to manipulator trajectory planning
Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
[RSS 2025] CLIP-RT : Learning Language-Conditioned Robotic Policies from Natural Language Supervision
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Repository for the EMNLP'24 paper "Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models". Chiyah-Garcia et al. https://arxiv.org/abs/2409.14247
EARL: Environment for Autonomous Reinforcement Learning
A curated list of awesome NVIDIA Issac Gym frameworks, papers, software, and resources
Setup for Octo and some experiments with the model
Curated repository of papers on integrating reinforcement learning with generative AI models in robotics, featuring categorized Excel summaries of key analysis metrics like frameworks, applications…
An example RLDS dataset builder for X-embodiment dataset conversion.
Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"
🔥Highlighting the top ML papers every week.
https://arxiv.org/abs/2312.10807
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources