The first CVPR workshop on

3D Vision Language Models (VLMs) for Robotic Manipulation: Opportunities and Challenges

June 11, 2025, Nashville, TN. Location: TBA

Introduction

The intersection of 3D Vision-and-Language models (3D VLMs) in robotics presents a new frontier, blending spatial understanding with contextual reasoning. The Robo-3DVLM workshop seeks to explore the opportunities and challenges posed by integrating these technologies to enhance robot perception, decision-making, and interaction with the real world. As robots evolve to operate in increasingly complex environments, bridging the gap between 3D spatial reasoning and language understanding becomes critical. Key questions at the heart of this workshop include:

By addressing these questions, the workshop aims to drive conversations around the utility of 3D in robotic vision, the role of language in perception, and the limitations imposed by current data and hardware constraints. Through invited talks and interactive sessions, we aim to unite researchers from diverse disciplines to push the boundaries of multimodal learning in robotics, setting the stage for the next generation of intelligent systems.

Call for Papers

We are excited to announce the Call for Papers for the Robo-3DVLM workshop. We invite original contributions presenting novel ideas, research, and applications relevant to the workshop’s theme.

Important Dates

Event Date
Call for Papers January 30th, 2025
Submission Deadline April 15th, 2025, 23:59 PST
Notification May 10th, 2025
Camera-Ready May 25th, 2025

Submission Guidelines

Paper topics

A non-exhaustive list of relevant topics:

Workshop Schedule (Tentative)

Start Time (PDT) End Time (PDT) Event
9:00 AM 9:10 AM Opening remarks
9:10 AM 9:45 AM Hao Su
Talk Title (TBD)
9:45 AM 10:20 AM Chelsea Finn
Pretraining and Posttraining Robotic Foundation Models
10:20 AM 10:55 AM Angel Chang
Building vision-language maps for embodied AI
10:55 AM 11:10 AM Coffee Break
11:10 AM 11:45 AM Yunzhi Li
Foundation Models for Structured Scene Modeling in Robotic Manipulation
11:45 AM 12:20 PM Katerina Fragkiada
Talk Title (TBD)
12:20 PM 1:30 PM Lunch
1:30 PM 2:00 PM Poster Session
2:00 PM 2:35 PM Ranjay Krishna
Talk Title (TBD)
2:35 PM 3:10 PM Dieter Fox
Talk Title (TBD)
3:10 PM 3:25 PM Coffee Break
3:25 PM 4:00 PM -
Panel Discussion (TBD)
4:00 PM 4:45 PM Spotlight Paper Talks (5 min talk each / 2 min Q&A)
4:45 PM 5:00 PM Ending Remarks and Paper Awards
The website template is borrowed from here.
For inquiries, contact us at: robo-3dvlm@googlegroups.com