-
ETH Zürich
- Zurich, Switzerland
-
02:59
(UTC +02:00) - https://scholar.google.com/citations?user=RM-ZTq0AAAAJ&hl=en
- in/danilodjordjevic98
Highlights
- Pro
Stars
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
[ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues"
[NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"
This is the official repository of the paper "Towards Physically Executable 3D Gaussian for Embodied Navigation".
Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
A high-throughput and memory-efficient inference and serving engine for LLMs
[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
[IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
Towards Large Multimodal Models as Visual Foundation Agents
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
Nav-R1: Reasoning and Navigation in Embodied Scenes
Comprehensive guide for using Docker containers on Euler cluster at ETH Zurich
Clarity: A Minimalist Website Template for AI Research
Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"
Mobile manipulation research tools for roboticists
Official repo for paper "Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation".
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation
Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)