Skip to content
View danilodjor's full-sized avatar

Highlights

  • Pro

Block or report danilodjor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.

Python 18,238 2,023 Updated Apr 13, 2026

[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

C 220 29 Updated Mar 26, 2025

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

C 507 103 Updated Feb 5, 2026

Official Implementation of ReALFRED (ECCV'24)

Python 45 2 Updated Oct 11, 2024

[ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues"

Python 28 3 Updated Apr 1, 2026
Python 250 13 Updated Aug 6, 2025

[NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"

Python 28 Updated Dec 4, 2025

This is the official repository of the paper "Towards Physically Executable 3D Gaussian for Embodied Navigation".

Python 168 6 Updated Dec 17, 2025

Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"

Python 447 71 Updated Jul 20, 2023

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Python 192 18 Updated Apr 9, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,293 15,488 Updated Apr 13, 2026

[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Python 113 5 Updated Jan 27, 2026

[IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation

Python 161 5 Updated Mar 24, 2026

Towards Large Multimodal Models as Visual Foundation Agents

Python 263 11 Updated Apr 24, 2025

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Python 402 15 Updated Dec 22, 2025

Nav-R1: Reasoning and Navigation in Embodied Scenes

Python 122 2 Updated Oct 31, 2025

Comprehensive guide for using Docker containers on Euler cluster at ETH Zurich

5 Updated Oct 2, 2025

Clarity: A Minimalist Website Template for AI Research

CSS 204 33 Updated Mar 11, 2026
Python 425 16 Updated Jul 29, 2024

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 341 18 Updated Jan 6, 2026

SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World

Python 152 18 Updated Nov 4, 2024

[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"

Python 577 49 Updated Aug 20, 2025

Mobile manipulation research tools for roboticists

Python 1,198 151 Updated Jun 8, 2024

Official repo for paper "Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation".

Python 10 1 Updated Nov 25, 2025

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Python 1,323 184 Updated Mar 18, 2026

LOVMM

Python 11 Updated May 19, 2025

[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation

Python 214 5 Updated Jul 2, 2025

Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)

Python 282 27 Updated Mar 6, 2025
Next